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DNA 

This invention relates to DNA comprising a nucleo- 
tide sequence coding for at least a portion of a polypept- 
ide which is a plant toxin of the ricin type, as herein- 
after defined- It also relates to recombinant DNA 
molecules comprising a DNA sequence which codes for a 
polypeptide which is or is closely related to a natural 
plant toxin of the ricin type. Ricin, and also other 
plant toxins such as abrin, modeccin and viscumin, 
consist of two polypeptide chains (known as the A and B 
chains) linked by a disulphide bridge, one chain (the A 
chain) being primarily responsible for the cytotoxic 
and the other chain (the B chain) having sites enabling 
the molecule to bind to cell surfaces. Ricin is produced 
in the plant Ricinis communis (also known as the castor 
bean plant ) via a precursor protein known as 
"preproricin 11 . 

Preproricin comprises a single polypeptide chain 
which includes a leader sequence. The leader sequence 
is subsequently removed in the organism to give proricin 
which is then cleaved to eliminate a linker region and 
joined by a disulphide bond to from the mature protein. 
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The toxicity of ricin-type toxins operat s in three 
phases: (l) binding to the cell surface via the B 
chain; (2) penetration of at least the A chain into' 
the cytosol, and (3) inhibition of protein synthesis 
through the A chain attacking the 60S subunits of the 
ribosomes. Thus, separated A and B chains are 
essentially non-toxic, the inherently toxic A chain 
lacking the ability to bind to cell surfaces in the 
absence of the B chain. 

It is also known that in ricin-type toxins the B 
chain binds to cell surfaces by virtue of galactose 
recognition sites, which react with glycoproteins or 
glycolipids exposed at the cell surface. 

It has already been suggested that the toxicity 
of the ricin A chain might be exploited in anti-tumour 
therapy, by replacing the indiscriminately-binding B 
chain with a different carrier component having the 
ability to bind only to tumour cells. Thus, various 
immunotoxins have already been prepared, consisting 
of a conjugate of whole ricin or a separated natural 
ricin A chain and a tumour-specific monoclonal antibody. 
Although these known conjugates are of considerable 
potential in themselves, there is scope for improvement. 

One problem with the known conjugates arises from 
a structural feature of the A chain from natural ricin. 
It is known that the natural ricin A chain becomes N- 
glycosylated during its synthesis, by enzymes present 
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in Ricinus c lis, and it is thought that the resulting 
sugar moieties are capable of non-sp cific int ractions 
with cell surfaces. Thus, it appears that the known 
A chain conjugates are capable of a certain amount of 
binding with non target cells, even in the absence of 
the natural B chain, thus increasing the toxicity of 
such immunotoxins towards non target cells. 

Another problem with the known ricin A chain 
conjugates stems from the fact that the B chain seems 
to have an important secondary function in that it some- 
how assists in the intoxication process, apart from 
its primary function in binding the ricin molecule to 
the cell surface. This secondary function is lost 
if the B chain is replaced by a different carrier component 
such as a monoclonal antibody. 

If it were possible to prevent interactions between 
the cell surface via the A chain sugar moieties, whilst 
preserving the secondary toxicity-increasing function 
of the B chain, the toxicity of a whole ricin antibody 
conjugate towards normal cells could be reduced, and 
towards target cells could be increased, thus improving 
the therapeutic index of the immunotoxin. It is also 
known that the natural ricin B chain is N-glycosylate d 
and the B chain sugar moieties may also contribute to 
non specific interactions, Also, the sugar moieties 
in both chains enable the ricin molecule to be sequest- 
rated by reticuloendothelial cells in the liver, and so 
would lead to the 
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rapid xcretion from th system f a drug bas d on a 
part r th whol of th ricin molecul in which such 
sugar moieties were still present. 

Attempts to remove all the sugar moieties from 
natural ricin by chemical or enzymatic methods have 
so far failed. Nevertheless the major obstacle con- 
fronting the use of known whole ricin-antibody conjugates 
is the presence of two galactose binding sites in the 
ricin B chain. These B chain galactose binding sites 
are primarily responsible for the non-specific cellular 
interactions of current whole ricin-antibody conjugates, 
particularly when used in vivo . Their presence in 
the natural toxin clearly eliminates or reduces the 
targeting specif ity conferred by the antibody. 

An improved immunotoxin based on ricin or another 
plant toxin of the ricin type, not suffering from these 
problems, could consist of a whole toxin molecule 
modified so that it is not N-glycosylated, and so that 
the B chain has no galactose recognition sites, but 
retains its secondary intoxication-promoting properties, 
coupled to a carrier moeity which delivers the toxin 
to the target cells. This could be a tumour-specific 
or cell /tissue specific vehicle such as a suitable mono- 
clonal antibody. 

Our research which has so far been concentrated 
on ricin itself, has indicated that the assembly of 
ricin (and the related agglutinin which consists of 
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two ricin-like molecules with slightly modifi d A and 
B chains) do s n t involve th separate synthesis of 
th A and B chains as th products of distinct mRNA »s, 

but rather the initial formation of a single 
polypeptide precursor containing both the A chain and 
B chain sequences. This is thought to apply in the 
case of other toxins of the same type. 

This invention is based on the idea of preparing 
a genetically-engineered microorganism capable of express- 
ing a molecule of a toxin of the ricin type, as defined 
above, 

or alternatively part of such a toxin molecule 
or a precursor of such a molecule (which could be 
converted to the toxin molecule itself) which toxin 
molecule could be modified as suggested above and could 
be used to construct an effective toxin conjugate by 
combining it with a tumour-specific or cell /tissue 
specific monoclonal antibody or other carrier moiety, 
such as a hormone or lectin. 

The fact that ricin is formed via a precursor 
polypeptide, will enable a cell system to be constructed 
by known techniques which expresses a ricin precursor. 
The ricin precursor product could then be chemically 
or enzymatically converted to the desired modified ricin. 
An analagous technique could be used in the case of 
other ricin-type toxins as herein defined. An alternative 
technique would be to divide from the DNA sequence that codes 
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for the precursor two 6 quences which code separately 

f r the A and B chains, t insert these s parated 
s quenc s int different cl ning vehicles and to insert 
the resulting recombinant DKA molecules into separate 
host microorganisms. One such host would then express 
the A chain polypeptide sequence and the other the B 
chain polypeptide sequence. These sequences could 
then be combined to form the desired modified ricin 
molecule. This technique could obviously also be used 
for any ricin-type toxins in which the A and B chains 
are encoded by distinct mRNA gene pools. This latter 
approach would be preferred on safety grounds, in that 
separate and therefore non-toxic A and B chains would 
be expressed. 

According to one aspect of the invention we provide 
a biologically pure and homogeneous sample of DKA com- 
prising a nucleotide sequence coding for at least a 
portion of a precursor of a ricin-type toxin polypeptide, 
or mutants thereof. 

Said portion preferably comprises the A chain 
or the B chain of the mature protein. 

More specifically, we provide a sample of 
DNA including at least a substantial portion of any 
of the following DNA sequences, which sample is 
biologically-pure : 
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ATG 1AT GCA GIG GCA AC A 3GG CI'I 



♦ • • • » • 

TGI 1T7 GGA 7CC ACC 1CA GGG 1GG 3C1 TIC ACA 71A GAG GA1 AAC AAC AT A 



IK CCC AAA CAA 1AC CCA ATT ATA AAC 111 ACC ACA GCG GG1 GCC A C1 GIG 



• • • • • 

CAA AGC. TAC ACA AAC TTT ATC AGA OCT GTT CGC GGT CGT TTA ACA AC! GGA 



* • m • m 

GC1 GA1 GIG AGA CA3 GA1 ATA CCA GIG 31G CCA AAC AGA GT1 GG3 31G CC1 



• • • • 

A1A AAC CAA CGG TIT ATI IT A Gil GAA C1C 1CA AA1 CA1 GCA GAG C33 It 1 



Gil ACA 31 A GCC CTC, GA3 G1C ACC A A3 (VA 3A3 GIG G1C UX 3 AC (CI (-"1 



GGA AA1 AGC GCA 1 A3 IK 311 CAl CC1 GAC AA1 CAG GAA GA1 GCA GAA GCA 



ATC ACT CAl Cll TIC ACT GA1 GT1 CAA k\1 CG^ 1AT ACA lit GCC TO GG1 

Page 7a follows... 
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GG1 AA7 7 AT GAT AGA CTT 6AA CAA C7T GCI GCT AA1 CTG AGA GAA AAT Alt 



GAG TIG GGA AA1 GGT CCA CTA GAG GAG GCT Alt 7CA GCG CT7 7 AT 7A7 7 AC 



ACT ACT GGT GGC ACT CAG CTT CCA ACI CTG GCT CCT TCC TTT ATA ATT TGC 



ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 



5 CGC ACG AGA ATT AGG TAC AAC CGG AGA Id GCA CCA GA7 CCT AGC G1A All 



•AC A C1T GAG AA1 ACT 7GG GGG AGA CT1 ICC ACT GCA ATT CAA GAG 7C7 AAC 



CAA GGA GCC TTT GCI ACT CCA ATT CAA CTG CAA AGA CCT AA7 GCT TCC AAA 



7TC ACT GTG TAC GAT GTG ACT ATA TTA A1C CC7 A1C A1A GCI CTC ATG GIG 



• • • • • 

TA1 AGA 7GC GCA CCT CCA CCA TCG TCA CAG TTT TCI 1TG CTT ATA AGG CCA 

Page 7b follows... 
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m # • • • • 

GTG GTA CCA AAT TIT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA G1C 

• • • 

CGT ATC GTA GOT CGA AAT GGT CIA TGI GTT GAT GTT AGG GAT GGA AGA TIC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 

• • m m m 

Ml CAG Clr TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 

• • • • 

TGI TTA ACT ACT TAC GGG TAC AGT CCG GGAGTC TAJ GTG ATG ATC TAT CA1 
TGC AAT ACT GC7 GCA ACT GAT GCC ACC CGC TGG C.AA ATA TGG GAT 

* • • • • • 

AA1 GGA ACC ATC ATA Ml CCC AGA TCT AGT CIA GTT TTA GCA GCG ACA TCA 
GGC AAC AGT GGT ACC ACA CIT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 

• • • • • 

CAA GGT TGG CTT CCT ACT AAT AAT ACA CAA CCT TTT GTT ACA ACC ATT GTT 

Page 7c follows... 
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GGG CTA TAT GCT CTG TGC TTG CAA GCA AAT ACT GGA CAA GTA TGG ATA GAG 



' GAC TCT AGC ACT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GCT 



TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA ACT GAT 1CT AAT 



ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TCT GGC CCT GCA TCC TCT GGC 



• • • • 

5. CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAl TTG 1AT ACT GGA 



• • • • • 

TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 



1AC CC1 CTC CAT GCT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 



or 
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ATC TAT GCA "GTG GCA ACA TGG CTT 

• • • • • • 

TGT TIT GGA TCC ACC TCA GGG 1GG TCI 1TC ACA TTA GAG GAT AAC AAC ATA 

# 

TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGT GCC ACT GTG 

• • • • • 

CAA AX TAC ACA AAC TTT ATC AGA OCT GTT CQC GGT CGT TTA ACA ACT GGA 
GCT GAT GTG AGA CAT GAT ATA CCA GTG TTG CCA AAC AGA GTT GGT TTG CCT 

„ • • • • • 

5 * ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT 1CI 

GTT ACA TTA GCC CTG GA1 GTC ACC AAT GCA TAT GTG GTC GGC TAC CGJ GC1 

GGA AA1 AGC GCA TAT TTC TTT CAT CC1 GAC AA1 CAG GAA GAT GCA GAA GCA 

ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 

Page 8a follows... 
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GGT AM TAT GAT AGA CTT GAA CAA CTI GCI GGT AAT CTG AG* GAA AAT ATC 



GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATCTCA GCG CTT TAT TAT TAC 



ACT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT ICC TTT ATA ATT TGC 



ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATC 



5. CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCI AGC GTA ATT 



ACA CTT GAG AAT ACT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 



CAA GGA GCC TTT GCT 'ACT CCA ATT CAA CTG CAA AGA CGT AAT GG1 TCC AAA 



TTt ACT GTG TAC GAT CTG ACT ATA TTA ATCCCT ATC ATA GCT CTC ATG GTG 



TAT AGA TGC GCA CCT CCA CCA TCG TCACAG ITT 
or Page 8b f Hows 
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GCT GAT GTJ I'd ATG GAT CCT GAG CCC ATA GTG 
CGI ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 

• • • 

CAC AAC GGA AAC GCA ATA GAG TTC TGG CCA TGC AAG TCT AAT ACA GAT GCA 

AAT CAG CTr TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 

TGJ TTA ACT ACT TAC GGG TAC ACT CCG GGAGTC TAT GTG ATG ATC 1AT GAT 

TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 

AA1 GGA ACC ATC ATA AAT CCC AGA 1CT AGT CTA Gn TTA GCA GCG ACA TCA 

GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGI 

CAA GGT TGG CTT CCT ACT AAT AAT ACA CAA CCT TTT GTT ACA ACC ATT GTT 

Page 8c follows... 
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GGG CIA TA1 GGT C1G 7GC TTG CAA GCA AAT ACT GGA CAA CIA 1GG ATA G*G 

G4C TCT AGC ACT GAA AAG GCT GAA CAA CAG TGG OCT CTT TAT GCA GAT GGT 

TCA ATA CCT CCI CAG CAA AAC CGA GAT AAT TGC CTT ACA ACT Gtf 1CT AAT 

ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 

5 -CAA CG* TGG ATG TTC AAG AAT GAT GGA ACC AIT TIA AAl TTG 1 Al ACT GGA 

• • • • • 

TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC An CTT 

• • • • 

TAC CCT CTC CAT GCT GAC CCA AAC CA* ATA 3GG TTA CCA TTA TTT 
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or at least a portion of a nucleotid sequence which 
is equivalent thereto by virtue of degeneracy of 
the genetic code. 

According to another aspect of this invention 
5 we provide a recombinant DKA molecule comprising 

a DNA sequence which codes for a polypeptide sequence 
present in a plant toxin of the ricin type, as defined 
herein . 

More specifically, we provide a recombinant DNA 
0 molecule containing a DNA sequence which codes for 
a 
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A and B chain precursor polypeptide of a plant toxin 
of the ricin typ . 

Alternatively, we provide a recombinant DKA 
molecule containing a DNA sequence coding for at least 
a portion of either the A chain or the B chain of a 
plant toxin of the ricin type. 

According to a further aspect of the present 
invention we provide a genetically modified host micro- 
organism which contains a recombinant DKA molecule as 
hereinbefore defined. 

In the recombinant DNA molecules according to 
the present invention, the nucleotide sequence coding 
for the B chain may be modified to eliminate or inactivate 
the galactose binding sites, and the precursor polypeptide 
and hence in the mature protein, to eliminate or inactivate 
the galactose binding sites, and the sequences encoding 
ther signals for N-glycosylation may also be modified 
to render them ineffective or to eliminate them. Examples 
of techniques which may prove useful are deletion or 
oligonucleotide mediated mutagenesis. 

The host organism may be a plant cell or an 
animal cell or preferably a microorganism. 

The microorganism may be a prokaryote or a 
eukaryote. As examples of prokaryotes may be mentioned 
Gram-negative bacteria, e.g. E.coli. ^ Methylophilug 
methylotrophus and Alcaligenes eutrophus; and Gram-positiv 
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bact ria, e.g. Streptomyces, Bacillus subtilis and 
Arthrobacter . As examples f eukaryot s may be mentioned 
yeasts, for example Saccharomyces cerevisiae . 

The recombinant DNA molecules may comprise a cloning 
5 vector such as a plasmid or phage vector into which 
has been inserted the DNA sequence coding for at least 
a portion of a precursor polypeptide, or at least a 
portion of either the A chain or the B chain, of a 
ricin-type plant toxin. 

10 The cloning vector is preferably a plasmid although 
we do not exclude the possibility that it may be a 
phage vector. The plasmid may be a naturally-occurring 
plasmid or preferably a composite derived from fragments 
of other plasmids. Where a composite plasmid is 

15 employed it preferably contains promoter sequences which 
improve expression of the ricin gene. 

Examples of suitable plasmids which may be used 
as cloning vehicles are inter ali a for Gram-negative 
bacteria: pBR322, pAT153, pUC8, pGSSIS and pMB9: for 

20 Gram-positive bacteria: pVC6: and for S. cerevisiae : 

PMA91, pMA230, YRp7, pLC544 and YEp6. The vector will be 
selected to be suitable for the particular host envisaged. 

We also provide a method of obtaining a recombinant DNA 
molecule which comprises preparing a double-stranded DNA 

25 sequence which codes for a polypeptide sequence present 
in a plant toxin of the ricin type and inserting said 
double-stranded DNA sequence into a cloning vector. 
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More specifically such a in thod may comprise 
isolating the mRNA which codes for the ricin A and B 
chain precursor polypeptide, synthesising single- 
stranded cDNA from said mRNA using reverse transcriptase 
and a suitable primer, assembling the second DNA strand 
onto the template formed by said first strand by means 
of DNA polymerase followed by SI nuclease, and inserting 
the resulting double-stranded cDNA into a cloning vector* 

Alternatively, the cDNA .assembled from the mRNA 
may be cut into separate portions which code respectively 
for separate portions of the ricin molecule, for example 
for the A anu B chains, which portions are then inserted 
inuo separate cloning vectors. 

As stated above the cloning vector is preferably 
a plasmid such as pBR322, pAT153, of pUC8, and this 
may be cut open by means of the restriction endonuclease 
Pst I, and may • be tailed with oligo (dG), 

and annealed with the double-stranded cDNA which has 
been tailed with oligo (dC). 

We also provide a method of producing a modified 
transformed host by introducing into a suitable host 
microorganism a recombinant DNA 

molecule according to this invention. 

The microorganism used as the host for cloning 
is preferably a Gram-negative bacterium and more preferably 
E . col i . 

After cloning, -the DNA sequence coding for the 
ricin precursor (or the precursor f anoth r ricin-type 
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toxin which is f rmed from a precursor) may be removed 
from the host cloning vector. It may then be divided 
into two portions which code for separate regions of 
the toxin molecule, for example the A and B chains , 
5 these portions introduced into separate second cloning 

vectors and new hosts modified with each of the resulting 

new recombinant DNA molecules. Alternatively it may 

be introduced whole into a second cloning vector. 

The second cloning vectors comprise suitable promoter 

10 sequences and the position and direction of insertion 
of the whole coding sequence or portion thereof into 
the second cloning vector are such that on introducing 
the new recombinant DNA molecules into suitable host 
microorganisms, e.g. L. coli or S. cerevisiae ? expression 

*5 of the desired gene sequence is obtained. 

The preparation of a transformed host containing 
a DNA sequence which codes for '„he ricin A and B chain 
precursor polypeptide will now be described by way of 
example, first in general terms and then in detail. 

20 This process is summarised in the accompanying diagram. 

Firstly, the mRNA encoding this precursor was 
enriched in known manner by sucrose density gradient 
centrifugation. The corresponding cDNA was assembled 
in single-strand form onto this mRNA in known manner 

25 using the enzyme reverse transcriptase, a growing point 
having first been provided on the mRNA using oligo (dT) 
as primer which binds onto the polyadenylated 3' -terminus 
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f th mRNA. The imm diate product of this reaction 
is a DNA-RNA hybrid. The RNA strand is removed by 
hydrolysis, leaving the single-strand DNA intact* 
This is converted to the double stranded form using 
5 the enzyme DNA polymerase in the presence of free 

nucleotides, which results in a hairpin-shaped molecule, 
the curved end of which is then removed by the single- 
strand specific nuclease Si. The resulting double 

using terminal transferase 
strand cDNA is then tailed with oligo (dC)/, size fraction- 

10 ated to remove small molecules, or vice versa and annealed 

with the pBR322 or pAT153 vector which has been cut 

. . . _ A , . . „ using terminal transferase 

open with Pst 3 and taxied with oligo (dG)/, the cytosine 

tails on the DKA base pairing with the guanine tails 
on the vector. 

35 The resulting chimaeric plasmids containing the 

DKA segment coding for the ricin precursor polypeptide 
were then used to transform E-coli DH1 cells, and 
the presence of the chimaeric plasmid was ensured 
by selecting cells displa}*ing -tetracycline resistance 

20 and ampicillin sensitivity. Over 1600 Tet r , Amp S 

clones were obtained. Colonies derived from each 

clone were transferred to nitrocellulose filters 

and clones containing the desired DNA sequence 

identified using a 32P-end labelled 20 mer oligonucleo- 

25 tide probe, having the DKA sequence ACCTACAA^Tt£tT^CT^CC 

G T G G 

which hybridises to DKA containing the complementary 
sequence TGGATGTt£aA^Aa£ga£gG. As the ricin precursor 
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polypeptide have been found "to contain "the amino 
sequence - Trp-Met-Fhe-Lys-Asn-Asp-Gly- the DKA 

sequence responsible for this is known from the 

genetic code to be the latter mentioned above. 
» - 
Using appropriate hybridisation and wash conditions, 

e.g. 25 described by Singer-Sam ct al in (3 983) 
Proc. Natl- Acad. Sci. (U.S.A. ), Vol.80 pp 802-806, 
80 clones were selected as positive^' containing 
the desired DKA sequence, and of these, the eight 
largest in the plasmid pBR322 have been initially 
chosen for further characterisation. Their relation- 
ship to the castor bean lectin precursor polypeptides 
has been confirmed using the hybrid release translation 
assay. Of the eight clones mentioned above, four % 

respectively with 1614, 1950; 1059 ™d 1020 oase pairs, 
have been selected for sequencing. 

In detail, the transformed host vas prepared 
as f ollows : 

A. cDKA synthesis 

1. mftKA extraction and fractionation , 

100-200 g of ripening Ricinus &eeds were frozen auu ground 
to a powder in liquid nitrogen, and homogenised 
in a Varing blender for 1-2 minutes in 50 m>5 tris- 
HC1 ph 9, 150 m>5 KaCl, 5 mM EOT A and S% SDS. The 
homogenate was extracted with an equal volume of 
phenol : chloroform (1:1) and the phases were separated 
by centrifugation. The rganic phase and residue 
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were reextracted with 0.5 volume f 20 mM tris-.HCl 
pH 9.0, 2 mM EDTA and the resultant aque us phase 

was combined with the original one. The total 
aqueous phase was reextracted repeatedly with' equal volumes 
5 of phenol: chloroform until np material was present 
at the interface. RRA was precipitated, by the 
addition of 2 volumes of cold ethanol after 'making 
( the solution 200 mM in KaCl. 

After overnight precipitation at - 20°C the RKA 
10 vas centrifuged at 3 0,000 rpm for 30 minutes in 
an MSE 3 8 or MSE 21 centrifuge; the pellet was 
then washed repeatedly in 3 M KaAc pH 5.5 until 
no polysaccharide was detectable in the supernatant 
by ethanol precipitation. The final pellet was 
J 5 dissolved in 3 00 mM KaCl and precipitated as above. 

mRKA molecules bearing poly(A) tails were extracted 
by affinity chromatography on oli go (dT) -cellulose: 
after hybridisation at room temperature for 30 min. 
in 400 mM KaCl , 20 mM tris-HCl pH 7.6, 0.2* SDS, 
20 the beads were pelleted and washed three times in 
the above buffer and two times in 200 mM KaCl, 
20 mM tris-HCl pH 7 .6, 0./1JS SDS. The slurry was 
poured into a column and washed further with the 
3ast buffer until the A 26o of the eluate reached 
25 the background level. Poly (A) -containing RNA was 
then eluted with 20 mM tris-HCl pH 7.6 at 50°C. 
The eluate was monitored with an 1SC0 continuous 
flow UV cell. Poly(A)-containing RNA was precipitated 
overnight from 200 mM KaCl by the addition of 2 " 
30 volumes of cold ethanol at - 20°C and was then washed 
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thre times with 70% ethanol,. and redissolved in 
10 mM tris-HCl pH 7.0 to approximately 1 pg/pl. 

The mRNA was heated for 2 rain, at 65° C' and quench 
cooled. Approximately 400 pg of poly (A) 4 RKA was 
layered on top of a 3 0 - 30% ribonuclease-f ree sucrose 
(Sigma) density gradient in 100 mM tris-HCl pH 7.5, 
0.5? SDS, J mM EDTA, and centrifuged in a Beckman 
15-65B centrifuge using an SW27'rotor at 25,000 
rpm at 17°C for 14 hours. '400.pl factions were 
collected with an 1SC0 density gradient fractionator 
using the continuous flow UV cell. 

Each fraction was made 200 mM in KaCl and precipitated 
vith 2 volumes of cold ethanol by freeze -thawing 
in liquid nitrogen three times, and recovered by 
centrifugation in an Eppendorf microcentrifuge for 
30 minutes at 4° C, vashed once with 70* ethanol, 

and redissolved in 3 0 ul of 30 mM tris-h*C3 pH 7.0. • 
An aliquot from each fraction (l ul ) was translated 
in a reticulocyte lysate cell free system and the 
lectin precursor immunoprecipitated to identify 
the fraction enriched for lectin mRNA. 

2. first strand synthesis 

Fractionated poly(A) 4 RKA was reverse transcribed 
at 0.5 pg/yl in the presence of 50 mM tris-HCl pH. 
B.3, 30 mM MgC3 2 , 100 mM KC3, 3 mM of dA TP, dTTP 
and dGTP, 250 dCTP, 0.06 pg/ul olig (dT^ gf 
30 mM DTT and 0.4 units/^l of reverse transcriptase 
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fr m avian myeloblast sis virus. (3H)dCTP or 

a-( P)dCTP were included in the reaction as appropriate. 

The reaction mixture was incubated at 42 C for 45 

minutes, at vhich point an equal volume of 5 mM 

5 tris-HCl pH 8.3, 5 mM DTI, 250 jiM dCTP vas added 

along with the same amount of enzyme as previously. 

The reaction vas incubated for a further 45 minutes 

at 4 5°C and terminated by freezing. Aliquots were 

analysed on 1% denaturing agarose gels along with 

3 0 the products of the second; strand and Sj nuclease 

reactions. 

3- Second strand sj'nthesis 

mRNA-cDNA hybrids were denatured by boiling the 
first strand reaction for 3 minutes and cooling 

3 5 rapidly. After pelleting insoluble material in 

the Eppendorf roicrof uge for 2 minutes -the supernatant 
was transferred to a fresh chilled tube. For the 
standard reaction, reagents were added as follows, 
ignoring elements already present: dATP, dGTP and 

20 dTTP to 100 pM, Hepes-KOH pH 6.9 to 305 mM, KCl 

to 92 mM, dCTP, labelled as appropriate, -to 80 jiM, 
and 0.3 units/jil of DKA polymerase. The reaction 
vas allowed to proceed at 20°C for 6 hours, at which 
time cDRA was removed from the mixture by gel filtration 

25 on 3 ml columns of Bio-Gel P60 in 3 0 mM tris-HCl 
pH 7.6, 20 mM KaC3, 3 mM EDTA. Fracti ns were 
monitored by Cerenkov or liquid scintillation counting, 
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and peak /excluded fractions were pooled and 
precipitated from 0.3 M XaAc'pH 6 by iJhe addition 
f 2 volumes of cold ethanol. Precipitates were 
recovered by centrif ugati on in the Eppendorf micro- 
centrifuge for 30 minutes in the cold, and dissolved 
in water -to about 2.5 jJg/ul of RKA - equivalent 
material • 

4. Sj nuclease digestion 

Single-stranded regions of double-stranded cDNA 
were digested with nuclease from Aspergillus 
oryxae , in the presence -of 300 roM KaCl, 30 mM 
KaAc pH 4-5, 3 m>J ZnCL 2 - The reaction was incubated 
for 15 minutes at 37°C and then for 3 5 minutes at 
15°C a and was terminated by -the addition of -tris- 
HC3 pH 7-6 to 3 30 mM and EDTA to 1 0 mM; it was 
then extracted with an equal volume of phenol : chloroform 
isoamyl alcohol (25:24:3) and precipitated from 
300 mM KaAc pH 6 with 2 volumes of cold ethanol. 
The precipitate was dissolved in 10 uiM tris-HCl 
pH B, 0.3 mM EDTA to 0.25 pg/yl KNA equivalent. 

5- Addition of homopolymer tails to DKA 

Double-stranded DNA was tailed using terminal 
transferase with dCTP at 0.001 - 0.01 jjg/yl in the 
presence of 140 mM potassium cacodylate pH 7.6, 
30 mM tris base, 0.1 mM DTT, 
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3 mM CoCJ 2 and ( 3 H) or (32p) _ labelled dCTp in 
75 - 350 -fold excess over 3' termini. The reaction 
vas carried out at 3? C for 6 minutes. The extent 
of incorporation of label vas followed "by assaying 
5 the amount of TCA - insoluble radioactivity as a 
proportion of the total radioactivity, counting 
in Bray's "scintillant . 

The reaction vas stopped by chilling and adding 
EDTA to 3 0 mM, after vhich unincorporated material 
10 was removed by gel filtration as described. Tailed 
cDNA was precipitated as before, and dissolved in 
3 M KaAc pH 8, 10 mM tris-acetate pH 8, 1 mM EDTA 

ready for fractionation. 

Pstl-cleaved pBR322DNA vas similarly treated, 
except that dGTP replaced dCTP. 

6. Fractionation of tailed cDKA 

cDNA was fractionated on 5 - 20? linear sucrose 
20 density gradients in 1 M KaAc pH 8, 3 0 mM -trds- 

acetate pH 8, 1 mM EDTA, and centrifuged overnight 
at 39,000 rpm in an SW50.3 rotor. DNA sedimentation 
vas checked on a parallel gradient loaded with a 
mixture of Hinfl and Pstl digests of pBR322 DNA, and 
25 fractions of this gradient were run on a 1% neutral ■ 
agarose gel. Fractions from the cDNA gradient 
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were diluted vith an equal volume of water and - 
pr cipitat d with 2 v lumes of cold ethanol," and 
then pooled to give thr e final fractions, a large 
cDNA fraction (larger than 2,200 bp), an intermediate 
fraction (1,000 - 2,200 bp) and a fraction containing 
smaller cDNAs (600 - 1,000 bp). cDKA molecules 

smaller than 600 bp were discarded. 

The three final fractions were dissolved to 
approximately 5 ng/ul in 3 50 mi RbCl, 3 0 mM tris- 
HCJ pH 7-6, 0.2 mM EDTA. 

B. An nealing and transformation 
3. Annealing 

dC-tailed cDKA was mixed with dG-tailed pBR322 or pAT3 
in approximately equimolar quantities, at a concentration 
of 0.4 ng/jil of vector. Buffers were as described 
above. The mixtures were heated to 70°C for 30 
minutes and then cooled overnight to room temperature, 
and slowly chilled to 4 °C. Competent cells were 
added and transformed as described below. 

Reparation of competent ce ll s ano transformation 

DH1 cells 91 were grown in 1 0 ml cultures of psi 

broth (2% tryptone, 0.5* yeast extract, 10 mM KaCl, 

20 mM MgCl 2 , pH adjusted to 7-6 with KOH; all 

bacteriological reagents from Difco), and grown 

at 37°C in a shaking waterbath to A , _ =07 

550 w -o« 

3 ml of this was then inoculated into 25 ml of the 
* (recAl,nalA, m^, ndol", R~, relAl ? ) 
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same medium and grown to ■ O.48. The cells 

w re then chilled on dee for 3 5 minutes and harvested 
at .5*000 rpm for 5 minutes in an.'MSE 23 centrifuge 
at 4°C. They were then resuspended in 3 0 ml of 
5 3 00 mM RbC3, 50 mM McC3 2 , 3 0 mM CaCl 2 , 35 wM NaAc 

pH 5-B, 152 glycerol and kept on ice for 3 5 minutes* 



The cells were again harvested, and resuspended 
in 3 ml of 30 mM RbC3 , 75 mM CaC3 2 , 30 mM MOPS - 

10 K0H pH 5.8, 3 5£ glycerol, and kept on ice for a 
further 3 5 minutes. 

3 00 jil of the cells thus prepared were mixed 
with the annealed DNA samples, and incubated on 
ice for 30 minutes, after which they were heat-shocked 

3 5 at 42° C for 90 - 320 seconds. 3 ml of psi broth 
was added, and the cells were grown at 37°C for 
3 'hour. They were then centrifuged" brief ly, and 
resuspended in 3 00 pi of psi broth and plate on 
LB plates containing 1 4 pg/ml tetracycline (LB is 

20 3% tryptone, 0.5% yeast extract, 370 mM 7SaC3, 3.5% * 
agar). 

After 38 - 24 hours growth at 37°C, colonies 
were counted and spotted onto LB plates containing 
33 /ig/ml ampicillin to identify those transf ormants 
25 containing recircularised or uncut plasmids. Over 
3 600 Tet r Amp fi clones wer picked and transferred 



30 



.23- 0145111 

in order d arrays onto large plates of LB containing 
34 ug/ml tetracyclin . • 

C. Screening 

3. Labelling of oligonucleotide 

The ricin B chain specific oligomer (20 mer) 
vas end labelled using polynucleotide kinase. 
500 ng of oligonucleotide was" incubated in 50 mM 
tris pH 8.5, 10 mM MgCl^.i'mM DTI., 0.1 mM spermidine- 
HC3, 0.3 mM EDTA with 60 jiCi * ( 32 p) . ATP and 1 jxl 
polynucleotide kinase (Boehringer) for 35 min at 
37°C. The reaction was stopped by adding an equal 
volume of 0.6 M NH^AC and the bulk of non incorporated 
VATP was removed by passage through a sephadex G25 
column in 0.34 M NaC3, 0.02 M tris pH.7.6, 0.005 M 
3 5 EDTA, 0.3? SDS. The probe was stored frozen 

at -20°C. 

i Colony Hy bridization using oligonucleotide -probe 

4 

Transformants were grown on nitrocellulose filters 
(Schleicher & Schuell 0.45 uj layered over LB plus 
20 tetracycline. The filters, in triplicate, were 
then transferred to LB-Tet plates dontainig 200 
pg/ml chloramphenicol f r 3 6 h at 37°. The filters 
vcre placed colony side up on two sheets of 3 mm 
paper wetted with 0.5 M KaOH for 35 wins at room 
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temperature. Th same procedure was followed for 
th following two washes (l) with 1M tris pH 8.0, 
and (2) with 1 W tris pH 8, 1.5.M NaCl (3Q aiins). 
The filters were air dried and baked at 80°C. 

5 Prehybradizations and hybridizations were done 

in double sealed polythene bags. The filters were 
prehybridized in 0.9 M KaCl, 0.09 M tris 7.4, 0.006 M 
EDTA, 0.5% HP40, 2x Denhardts, 0.2% SDS, 100 ug/ml 
denatured single strand salmon sperm DNA and 70 

3 0 ug/ml tRKA. Prehybridization was done for 4 h 
at 55°C. The prehybridization buffer was then 
squeezed from the bag and fresh buffer added that 
contained 50 ng labelled probe (to give a maximum 
concentration of 5 ng/ml buffer). Annealing was 

3 5 done overnight at 37°C. 

Kon stringent washes were done with '6 x SSC at ' 
room temperature. The filters were washed in A 
changes of 6.x SSC over 3h. The triplicate filters 
were then washed at three different temperatures 

20 determined from the base composition and degree 
of mismatch of the probe. Using 2°C for every 
A or T and 4°C for every C or 6 in the probe the wash tempera- 
tures selected vere 52°C, 56°C and 60°C. The filters 
were washed at the stringent temperature in 6 x " 

25 SSC for 3 0 minutes and then dried thoroughly. 

The filters were exposed to X-ray film overnight. 
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D. Hybrid selection procedure 

3. DKA binding 0145111 



Plasmid DNA was purifi d from the positive clone (s) 
and 10-15 /ig linearized with EcoRl . After phenol: 
5 chloroform extraction and ethanol precipitation 
the pellet was dissolved in 0.5 ml 0.1 x SSC. 
0.5 ml 1 M KaOH was then added and the mixture allowed 
to stand for 15 rains at room temperature. 4 ml 
of a prechilled neutralizing solution (1.5 >J KaCl , 

10 0.25 M HCl, 0.25 M tris- C*.pH 8.0) was added and 

the 5 ml DKA sample sucked by vacuum through swinnies 
containing wetted Schleicher and Schuell 0.45 u 
filter discs. 5 ml 6 x SSC was then passed through 
the filter(s). These were air dried and then baked 

15 for 2 h at 80°C. 

2. Hybrid selection protocol 



The filter (s) were placed in 5 ml bottles and 
prehybridized for .4 h at 43°C in 50% formamide, 
0.4 M NaCl, 10. mM pipes-UaOH pH 6.4, 4 raM EDTA, 

20 0.5 ysM tRKA, 3 0 pg/n.1 poly. (A). The buffer was 

removed and the filter(s) typically hybridized over- 
night at 41°C in 50* forraamide buffer (above) containing 
appro,, 20 ug poly (A)? RNA from castor beans. Tne buffer was removed 
and the filters washed twice for 15 rains in each 

25 of the following: (l) 1 * SSC, 0.5% SDS at room 
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temperature, (2) 0.1 x SSC, 0.156 SDS at room temperature, 
(3) 0.1 x SSC, 0.1% SDS at 50°C, (4) 0.1 x SSC, 0.156 
SDS at room temperature. The filters wer drain d 
and 200 ul hybrid release buffer (9056 formamide, 10 
5 mM pipes-NaOH pH 6.4, 1 mM EDTA, 0. 556 SDS) added to 
each and mixed for 30 mins at 40°C. The buffer was 
removed into a fresh eppendorf and KaCl added to .2 M. 
The released mRKA was precipitated with ethanol, rinsed 
several times in 7056 ethanol, dried and dissolved in 
10 5 |il sterile water. The sample was translated in a 
reticulocyte lysate cell free system and the products 
run out directly on an SDS-polyacrylamide gel or firstly 
immunoprecipitated with appropriate antisera, 

The DKA sequence coding for the above-mentioned 
15 ricin precursor polypeptide of two of the above-mentioned 
clones, referred to herein as pBRd 6 and pBRCL 17 (RCL 
= Ricinus communis lectin) has now been determined 

by a combination of the Sanger dideoxy method (Sanger et 
al, 1977 - Proc. Natl. Acad. Sci . U.S.A. 74, 5463-67) and 
20 the procedure of Maxam and Gilbert (Maxam and Gilbert, 1980 - 
Meth. Enzym. 65 ? 499- 560 ). In order to determine the sequence 
at the ends of each insert the inserts were excised from 
pBR322 with Pst I and ligated into Pst I linearized, 
phosphatased plasmid pUC& (Vierra and Messing, 1982 - 
25 Gene 19, 259-268). E. Coli DHI cells were transformed 
by these recombinant plasmids. These new recombinant 
plasmids ar referred to herein as pRC L 6 and pRC L 17. 
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II is appareOd 4*51 1 1 
the "two inserts contain a region of common sequence 
and that together they represent a total ricin precursor 
sequence. There are no nucleotide differences between 
the overlapping regions of the inserts in pRCL6 and 

pRCL3 7- 

A new recombinant DMA molecule was then constructed that contains 
the complete nucleotide sequence encoding the ricin 

precursor polj'peptide. This was achieved by isolating 
a fragment, 323 base pairs in length, obtained from 
pRCL37 by digestion with the restriction endonuclease 
Sau 963. and ligating this fragment to' a fragment 
^561 base pairs in length isolated after a partial 
digestion of pRCL6 with Sau 961 Ligation was performed 
in 50 mM trisHCl (pH/.4) 10 mM MgCl . 10 mM di thiothreitol . 
£ mM spermidine. 10 mM ATP, 0 - 3 rag /ml BSA with 5 units 
commercial T4 DKA ligase, and incubation proceeded 
overnight at 15°C. After a standard phenol /chl orof orm 
extraction and ethanol precipitation the li gated DKA 
was pelleted, dissolved in a small volume of 3 0 mM 
tris HCl (pH 7.4), lmM EDTA and digested to completion 
with Pst I. The resulting linearised DKA was then 
ligated (as above) with an equal quantity of Pst I 
linearised, phosphatased pl)C8. The new -recombinant DKA molecule 
containing the entire DXA sequence of the ricin pre- 
cursor and referred to as pRCL6l7, was used in 
conventional manner to transform E. coli DHI cells. 
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Th nucl tide s quence of pRCL6l7 is shown 
her inafter. 

This sequence was deduced from the two overlapping 
cDNA inserts in clones pR(L6 andpRCL17 (the limits 
of the DNA inserts in each of these two clones are 
given below) . 

Nucleotide residues are numbered in the 5 1 
to 3 1 direction with the first residue of the codon 
specifying the amino terminal residue of mature ricin 
A chain numbered 1 and the nucleotides on the 5' side 
of residue 1 indicated by negative numbers. The 5' 
terminal sequence does not extend to the 5' end of 
the mRNA whereas the 3 1 terminal sequence shown is 
followed by a poly (dA) tract 27 residues long, thus 
representing the complete sequence of the region. 
The predicted amino acid sequence is given below the 
nucleotide sequence and differences with the published 
amino acid sequence of mature ricin A and B chains 
(Funatsu Gr, Kimura, M and Funatsu, M. Agric.Biol. 
Chem. Vol 43, PP 2221-2224 (1979), and Yoshitake, S., 
Funatsu, G and Funatsu, M - Agric. Biol. Chem. Vol.42, 
PP 1267-1274 (1978) ) are indicated underneath. 
Residues absent from the published amino acid sequence 
are underlined with a dashed line and the position 
of amino acids present in the published sequence but 
absent from the derived sequence presented here are 
indicated by an asterisk. The dashed line beneath the 
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12 amino acid sequence linking the C-terminus of the 
A chain and the N-terminus of the B chain is bracketed. 
Amino acids are numbered from the amino terminal residue 
of the mature A chain and the preceding residues are 
5 indicated by negative numbers. Potential sites for 

asparagine linked N-glycosylation are boxed and potential 
poly (A) signals are underlined. The insert of pRCL6 
extends from nucleotide - 102 to residue 1512 and the 
insert of pRCL17 extends from nucleotide 733 to residue 
10 1782. 

The intervening twelve triplets code for the 
linker amino acid sequence which is present in the pre- 
cursor polypeptide and which is enzymatical ly removed 
in the cell to separate the A and B chains, which chains 

15 are joined by a disulphide bridge during the formation 
of the ricin molecule itself. This linker region as 
well as the presumptive amino terminal leader or signal 
sequence (amino acids - 24 to -l ) are not present in 
the sequences already published by Funatsu et al. 

20 Preproricin is the whole polypeptide coded for 

by the aforesaid DNA insert, i.e. from amino acid -24 
to amino acid 541. Proricin.. which is obtained from 
preproricin in the organism by removal of the amino 
acid leader sequence, extends from amino acid 1 to amino 

25 acid 541. 



* 30 " 0145111 

-100 -50 

5 1 — AAACCGGGAG gaaatactat tgtaatatgg atg tat GCA "GTGGCA aca tgg err 

Met Tyr Ala Val Ala Thr Trp Leu 
-20 

-11. 

TGT TIT GGA TCC ACC TCA GGG TGG TCI TTC ACA TTA GAG GAT AAC AAC ATA 
Cys Fhe Gly Ser Thr Ser Gly Trp 5er Phe Thr Leu Glu Asp Asn Asn lie 

-10 -1 1 

50 ' 

• 

TTC CCC AAA CAA TAC CCA ATT ATA AAC TIT ACC ACA GCG GGT GCC ACT GTG 
Phe Pro Lys Gin Tyr Pro lie lie [Asn Phe Thr \ Thr Ala Gly Ala Thr Val 

10 

100 

♦ 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGT CGT TTA ACA ACT GGA 
Gin Ser Tyr Thr Asn Phe lie Arg Ala Val Arg Gly Arg Leu Thr Thr Gly 
20 30 

150 

GCT GAT GTG AGA CAT GAT ATA CCA GTG TTG CCA AAC AGA GTT GGT TTG CCT 
Ala Asp Val Arg His Asp lie Pro Val Leu Pro Asn Arg Val Glv Leu Pro 
40 Glu 50* 

200 

ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT TCT 
He Asn Gin Arg Phe He Leu Val Qu Leu Ser Asn His Ala Glu Leu Ser 

60 Gin 

250 

• 

GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCI 
Val Thr Leu Ala Leu Asp Val Thr Asn Ala Tyr Val Val Gly Tyr Arg Ala 
70 - Ser 80 

300 

GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 
Gly Asn Ser Ala 3yr Phe Phe His Pro Asp Asn Gin Glu Asp Ala Glu Ala 



90 



100 



350 

ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 

lie Thr His Leu Phe Thr Asp Val Gin Asn Arg Tyr Thr Phe Ala Phe Gly 

110 120 



Page 30 a follows 
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400 



GGT AAT TAT GAT AGA CTT GAA CAA CTT GCT GGT AAT CTG AGA GAA AAT ATC 
Gly Asn Tyr Asp Arg Leu Glu Gin Leu Ala Gly Asn Leu Arc Glu Asn lie 

130 

450 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTI TAT TAT TAP 
Glu Leu Gly Asn Gly Pro Leu Glu Glu Ala lie Ser Ala Leu Tyr Tyr Tyr 
140 150 

500 

AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG. GCT CGT TCC TTT AIA ATT TGC 
Ser Thr Gly Gly Thr Gin Leu Pro Thr Leu Ala Arg Ser Phe lie lie Cvs 

160 " 170 ' 



550 

ATC CAA ATG ATT TCA GAA OCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
lie Gin Met lie Ser Glu Ala Ala Arg Phe Gin Tyr He Glu gTv Glu Met 

180 



600 

S rV f 5 l AC MC CGG AGA rcT °* CCA GAT CCI AQC GTA ATT 

Arg Thr Arg He Arg Tyr Asn Arg Arg Ser Ala Pro Asp Pro Ser Val Ue 

1 " D 200 



650 

ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 

Tnr Uu Glu Asn Ser Trp Gly Arg Leu Ser Thr Ala lie Gin Glu Ser A^n 
210 220 

700 

^ S C S 7 ?? T AGT CCA ATT CAA CTG AGA CGT AAT GGT TCC AAA 
GLn Gly Ala Phe Ala Ser Pro lie Gin Leu Qn Arg Arg jAsn <£■ £ l iff 



Asp 



750 

S £ 2£ S£ ?? if* T A CCT ATC ATA GCT CTC ATG GTG 
rne Ser * a l Tyr Asp Val Ser lie Leu lie Pro He He Ala Leu Met Val 

* Leu 250 

800 

S!!^55 AC pL T 2 A CCA TCG T^A CAG TTT TCT TTG CTT ATA AGG CCA 
Tyr Arg Cys Ala Pro Pro Pro Ser Ser Gin Phe Ser Leu Leu lie Arg Pro 
ZOO ( 

Page 30b follows 
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850 
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GTG GTA CCA AAT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
Val Val Pro Asn Phe Asn Ala Asp Val Cys Met Asp Pro Glu Pro lie Val 
)280 290 



90Q 

CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 
Arg He Val Gly Arg Asn Gly Leu Cys Val Asp Val Arg Asp Gly Arg Phe 

300 Asn 

950 

CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 
His Asn Gly Asn Ala lie Gin Leu Trp Pro Cys Lys Ser Asn Thr Asp Ala 
Asn His 310 320 

1000 

AAT CAG CTc TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
Asn Gin Leu Trp Thr Leu Lys Arg Asp Asn Thr lie Arg Ser Asn Gly Lys 

330 34b 

1050 

TGT TTA ACT ACT TAC GGG TAC AGT CCG GGAGTC TAT GTG ATG ATC TAT GAT 
Cys Leu Thr Thr Tyr Gly Tyr Ser Pro Gly Val Tyr Val Met lie Tyr Asp 

Pro Ser 



1100 

• • . m 

TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 
Cys Asn Thr Ala Ala Thr Asp Ala Thr Arg Trp Gin lie Trp Asp 
360 Thr Asp Glu Asn 

1150 

AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
jAsn Glv Thr [ lie lie Asn Pro Arg Ser Ser Leu Val Leu Ala Ala Thr Ser 

380 390 



1200 

* 

GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 
Gly Asn Ser Gly Thr Thr Leu Thr Val Gin Thr Asn lie Tyr Ala Val Ser 

400 



1250 

CAA GGT TGG CTT CCT ACT AAT AAT ACA CAA CCT TTT GTT ACA ACC ATT GTT 
Gin Gly Trp Leu Pro Thr lftsn Asn Thrj Gin Pro Phe Val Thr Thr lie Val 
Pro Phe Trp 420 



Page 30c follows 
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• • • • • 

GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT ACT GGA CAA GTA TGG ATA GAG 
Gly Leu Tyr Gly Leu Cys Leu Gin Ala Asn Ser Gly Gin Val Trp lie Glu 

430 Val 440 

1350 

GAC TGT AGC ACT GAA AAG GCT GAA CAA CAG TGG OCT CTT TAT GCA GAT GGT 
Asp Cys Ser Ser Glu Lys Ala Glu Gin Gin Trp Ala Leu Tyr Ala Asp Gly 
Ser Cys 450 Ser 

1400 

TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA ACT GAT TCT AAT 
Ser lie Arg Pro Gin Gin Asn Arg Asp Asn Cys Leu Thr Ser Asp Ser Asn 
Asn 460 Asn Arg 470 



1450 

ATA CGG GAA ACA GTT GIT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 
lie Arg Glu Thr Val Val Lys lie Leu Ser Cys Gly Pro Ala Ser Ser Gly 
480 490 

15QP 

CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT ACT GGA 
Gin Arg Trp Met Fhe Lys Asn Asp Gly Thr He Leu Asn Leu Tvr Ser Glv 
Glu 500 

1550 

TTG CTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
Leu Val Leu Asp Val Arg Arg Ser Asp Pro Ser Leu Lys Gin lie lie Leu 
510 Ala 520 

1600 

TAC CCT CTC CAT GCT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT TGA 
Tyr Pro Leu His Gly Asp Pro Asn Gin lie Trp Leu Pro Leu Phe *«■ 
Trp * — *Leu Pro 

1650 

TAGACAGATT ACTCTCTTGC AGTGTGTGTG TCCTGCCAT G AAAATAGATG GCTTAAATAA 
1700 ~~ ~ 



AAAGGACATT GTAAATTTTG TAACTGAAAG GACAGCAACT TATTGCAGTC CAGTATCTAA_ 
J 750 m _ 17 80 

TAAGAGCACA ACTATTGTCT TGTGCATTCT AAATTT-Poly(A) 
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CLAIMS : 

1 . DNA comprising a nucl otide 6 qu nc coding 
for at least a substantial portion of a plant toxin 
of the ricin-type or a mutant thereof, characterised 
in that it is biologically pure and homogeneous. 

2. DNA according to claim 1, characterised in 
that the nucleotide sequence codes for the A chain 
or the B chain of the mature toxin . 

3- DNA according to claim 1, 2 or 3, characterised 

in that the nucleotide sequence codes for a mutant 
in which the galactose binding site or sites have 
been removed or inactivated. 

4* A sample of DNA including 

at least a substantial portion of the following 

nucleotide sequence: 
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ATC TAT GCA GTCGCA ACA TGG CTT 
TGT TTT GGA ICC ACC TCA GGG 1GG TCI TTt ACA TTA GAG GAT AAC AAC ATA 

• • • 

TTC CCC AAA CAA TAC CCA ATT ATA AAC ITT ACC ACA GCG GG1 GCC A ci GIG 

CAA AGC TAC ACA AAC TTT ATC AGA OCT GTT CGC GGT CGT TTA ACA ACT GGA 

•GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTC CCT 

ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAI GCA GAG CTT TCI 

GIT ACA ITA GCC CTG CAI GTC ACC AAT GCA TA1 GIG G1C GGC TAC CGT GCT 

GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAI GCA GAA GCA 

ATC ACT CAI C1T TTC ACT GAT GTT CAA AA1 CGA TAI ACA TTC GCC ITT GGT 

Page 32a follows 
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GGI AAT TAT GAI AGA CTT GAA CAA CTI GCI GGT AA1 CTG AGA GAA AAT ATC 
G*G ITG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 

• • • • • 

AGT ACT GGT GQC ACT CAG CTT CCA ACT CTG GCI CGI ICC TTT ATA ATT TGC 

• • • * m 

ATC CAA ATG ATI TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 

• • • • . 

• CGC ACG AGA ATI AGG TAC AAC CGG AGA TCI GCA CCA GA1 CC1 AGC GTA ATI 

• • • • • 

ACA CTJ GAG A.AI AGT TGG GGG AGA CTI TCC An GCA ATI CAA GAG ICI AAC 

• • . . 

CAA GGA GCC ITI GCI AGT CCA ATT CAA CTG CAA AGA CGI AAT GGT TCC AAA 

TTC AG1 GTG TAC GAI GTG AGT ATA ITA A1C CCI A1C AIA GCT C3C ATG GTG 

1AI AG* TGC GCA CCI CCA CCA ICG TCA CAG I IT TCI TTG CTI ATA AGG CCA 

Page 32b follows... 
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GIG G1A CCA AA7 3T3 AA7 GC7 GAT Gil 7G7 A7G GA7 (CI GAG CCC A7A GIG 



CGI A1C GIA GG7 CGA AAT GG7 CIA 7G7 G]] GA7 GT1 AGG GAT GGA AGA 77C 



CAC AAC GGA AAC GCA ATA CAG TIG TGG CCA TGC AAG 3 CI AAT ACA GA7 GCA 



AA7 CAG CJr TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 



*7G7 T7A AC7 AC7 TAC GGG TAC AG7 CCG GCIA G7C 7A7 G7C» ATG A7C TAT GAT 



TGC AA7 AC7 GC] GCA AC7 GA7 GCC ACC CGC TG3 CAA ATA TGG GAT 



AA7 GGA ACC ATC ATA AAT CCC AGA 7C7 AG] C7A G77 T7A GCA GCG ACA Tf.A 



G35 AAC AG1 GG] ACC ACA C71 ACG G7G CAA ACC AAC ATT TAT GCC GTT AG7 



CAA GG7 TGG CT] CCT AC7 AM ■ AAT ACA CAA CC3 TT7 GT3 ACA ACC ATT GTT 

Page 32c follows.. 
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GGG CIA TA1 GG1 C7C- TGC TTG CAA OCA AAT ACT GGA CAA G1A 7GG AT A GAG 
GAC 7CT AGC ACT GAA AAG OCT GAA CAA CAG 7GG GCT CTT 7 AT GCA GAT GCT 
TCA ATA CGT CC7 CAG CAA AAC CGA GAT AAT 7GC CTT ACA ACT GA1 TCI AAI 

ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC* TCT GGC 

5 • 

CAA CGA 7GG A7G TTC AAG AA7 GA7 GGA ACC ATT TTA AA7 TTG 7 AT AG7 GGA 

TTG GTG T7A GA7 GTG AGG CGA ICG GA7 CCG AGC CTT AAA CAA ATC ATT CTT 

7AC CC7 CTC CAT GCT GAC CCA AAC CAA ATA 7GG TTA CCA TTA TTT 

or at least, a portion of a nucleotide sequence that is 
equivalent thereto by virtue of degeneracy of the genetic 
10 code. 

5. A sample of DNA including at least a substantial portion 
of the following nucleotide sequence: 
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A7C TAT GCA "GIG GCA ACA TGG CTJ 



TGI 1TJ CGk TCC ACC 3CA GGG 1GG 1C1 lit ACA TTA GAG GA1 AAC AAC A1A 



• - . 

TIC CCC AAA CAA 3 AC CCA ATT ATA AAC ITT ACC ACA GCG GGJ GCC A r, G]G 



CAA AGC TAC ACA AAC TTT ATC AGA GCT GT7 CGC GGT CGT TTA ACA AC1 GSA 



• • m 

5 OCT GA1 G3G AGA CAT GAT A1A CCA GIG 1TG CCA AAC AGA GTI GG1 TIG CC3 



A1A AA7 CAA CGG IT! ATI TJA GT7 GAA C1C TCA AA7 CAT GCA GAG CTJ 1C3 

■ 



G3T ACA 31 A GCC CTG GA7 G1C ACC Ml GCA Ik'l GTG G1C GGC TAC CGI (£3 



GA AA1 AGC GCA TA1 TIC 111 CAT CC1 GAC AV) CAC, (,AA GA1 GCA GAA GCA 



A7C AG CAT CH 3TC ACT CA1 GTJ CAA AAT CGA 3 A3 ACA TIC GCC TTJ GGT 

Page 33a follows... 
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• • • • • 

GGl AAT TAT GAT AGA CTT GAA CAA CTT GCT GCT AA7 CTG AGA GAA AAT ATC 

• • • . • • 

GAG TTG GGA AAT GGT CCA CPA GAG GAG GCI ATC ICA GCG CTT TAT TAT TAC 

• • - .... 

ACT ACT GGT GQC ACT CAG CTT CCA ACT CTG GCI CGT TCC TTT ATA ATT TGC 

ATC CAA ATG ATT TCA GAA GCA GCA AGA TIC CAA TAT ATT GAG GGA GAA ATG 
5 CGC ACG AGA ATT AGG T AC AAC CGC AGA 1CT GCA CCA GAT CCT AGC GTA ATT 

4 

ACA CTT GAG AAT ACT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCI AAC 
CAA GGA GCC TTT GCT ACT CCA ATT CAA OG CAA AGA CCT AAT GCT TCC AAA 
TTC ACT GTG TAC GAT CTG ACT ATA T1A ATC CCT A1C ATA GCT CTC ATG CTG 

• * • • 

1A7 AGA GCA CCJ CCA CCA ICG TCAQAG 1TJ , 
10 or at least a portion of a nucleotide sequence vhich is 
equivalent thereto by virtue of degeneracy of the genetic 
code. 
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6. A sample of DNA including at least a substantial portion 

of the following nucleotide sequence: 

* 

GCT GAT GTT 1GT ATG GAT CCT GAG CCC ATA GTG 



CGI ATC GTA GOT CGA AAT GCT CTA TOT GTT GAT GTT AGG GAT GGA AGV TTC 



5 CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 



AAT CAG C3r TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 



TGI TTA ACT ACT TAC GGG TAC ACT CCG GGAGTC TAT GTG ATG ATC TAT GAT 



TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 



AAT GGA ACC ATC ATA AAT CCC AGA TCT ACT CTA GTT TTA GCA GCG ACA TCA 



10 GGG AAC ACT GCT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT ACT 



CAA GCT TGG CTT CCT ACT AAT AAT ACA CAA CCT ITT GTT ACA ACC ATT GTT 

Page 34a follows... 
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GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT ACT GGA CAA CTA 1GG ATA GAG 
GAC TCT AQC ACT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
TCA ATA CCT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA ACT GAT TCT AAT 
ATA CGG GAA ACA CTT GIT AAG ATC CTC TCT TCT GGC CCT GCA TCC TCT GQC 
5 CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT ACT GGA 

• - . „ 

TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 

TAG CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 
or at least a portion of a nucleotide sequence vhich is 
equivalent thereto by virtue of degeneracy of the genetic 
10 code. 

7» A recombinant DNA molecule, characterised in that it 
includes as an insert a nucleotide sequence as defined in 
any preceding claim. 

8, A r combinant DNA molecule according t laim 7, 
*5 characteris d in that said insert has been introduced 
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into a cl ning vector which is a plasmid or a bacterio- 
phage . 

9. A modified host cell containing a recombinant 
DNA molecule according to claim 7 or claim 8. 

10. A modified host cell according to claim 9. 
characterised in that the host is a plant cell, an animal 
cell, a Gram-negative bacterium, a Gram-positive 
bacterium, or a yeast. 

11. A modified host microorganism according to claim 
10, characterised in that the host is a Gram-negative 
bacterium, which is any of E. coli , Methylophilus 
methylotropus , or Alcaligenes eutrophus . 

12. A modified host microorganism according to 
claim 10, characterised in that the host is a Gram- 
positive bacterium which is any of St.reptomyces , 
Bacillus or Arthrobacter . 

13- A modified host microorganism according to 
claim 10, characterised in that the host is a yeast 
which is Saccharomyces cervisiae . 

14* A modified host microorganism according to 
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claim 11, wherein the cloning vector is any suitable 
plasmid sel ct d from pBR322, pAT153> pUC8, pGS15 
or pMB9. 

15« A modified host microorganism according to 
claim 12, wherein the cloning vector is the plasmid 
pUC6 . 

16. A modified host microorganism according to 
claim 13, when the cloning vector is pMA91, pMA230, 
YRp7, PLC544, and YEp6. 

17. Method of preparing a biologically-pure sample 
of a cDNA sequence coding for a precursor ,or a portion 
thereof, of a plant toxin of the ricin type, 
characterised by isolating mRNA from the tissue of 

a plant which produces such a toxin, and synthesising 
cDNA from this by reverse transcription - 

1&. A method of obtaining a recombinant DNA molecule 
by inserting a double-stranded cDNA according to 
claim 17 into a cloning vector. 

19. A method of obtaining a genetically-modified 
host characterised in that a recombinant DNA molecule 
according to claim 18 is introduced into a host micro- 
organism. 
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In response to the Official Communication (EPO Form 1150) of the 
29th August 1984, we now enclose in triplicate re-typed pages 7 7a 
7b, 7c, 8, 8a, 8b, 8c, 30, 30a, 30b, 32, 32a, 32b, 32c, 33 and 34. ' 

The four codons TTA CCA TTA TTT have been added to the end of claim 
4, to bring the claim into exact agreement with the statement on 
original page 7. We submit that it would be obvious to one skilled 
in the art that the DNA sequence of claim 4 should be identical to the 
sequence on original page 7, and also to those portions of the DNA 
sequence given on original page 30 which are expressed as polypeptides 
and that this amendment is therefore allowable under Rule 88. 

We shall be grateful if you will acknowledge receipt of these documents 
using the enclosed copy letter and self-addressed envelope. 
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Description 

This invention relates to DNA comprising a nucleotide sequence coding for at least a portion of a 
polypeptide which is a plant t xin of the ricin type, as hereinafter defined. It also relates to recombinant 
5 DNA molecules c mprising a DNA sequence which codes for a polypeptide which is r is closely related to 
a natural plant toxin of the ricin type. Ricin, and also other plant toxins such as abrin, modeccin and 
viscumin, consist of two polypeptide chains (known as the A and B chains) linked by a disulphide bridge, 
one chain (the A chain) being primarily responsible for the cytotoxic and the other chain (the B chain) 
having sites enabling the molecule to bind to cell surfaces. Ricin is produced in the plant R/c/nis communis 
10 (also known as the castor bean plant) via a precursor protein known as "preproricin". 

Preproricin comprises a single polypeptide chain which includes a leader sequence. The leader 
sequence is subsequently removed in the organism to give proricin which is then cleaved to eliminate a 
linker region and joined by a disulphide bond to form the mature protein. 

The toxicity of ricin-type toxins operates in three phases: (1) binding to the cell surface via the B chain; 
is (2) penetration of at least the A chain into the cytosol, and (3) inhibition of protein synthesis through the A 
chain attacking the 60S subunits of the ribosomes. Thus, separated A and B chains are essentially non- 
toxic, the inherently toxic A chain lacking the ability to bind to cell surfaces in the absence of the B chain. 

It is also known that in ricin-type toxins the B chain binds to cell surfaces by virtue of galactose 
recognition sites, which react with glycoproteins or glycolipids exposed at the cell surface. 
20 It has already been suggested that the toxicity of the ricin A chain might be exploited in anti-tumour 
therapy, by replacing the indiscriminately-binding B chain with a different carrier component having the 
ability to bind only to tumour cells. Thus, various immunotoxins have already been prepared, consisting of 
a conjugate of whole ricin or a separated natural ricin A chain and a tumour-specific monoclonal antibody. 
Although these known conjugates are of considerable potential in themselves, there is scope for 
25 improvement 

One problem with the known conjugates arises from a structural feature of the A chain from natural 
ricin. It is known that the natural ricin A chain becomes N-glycosylated during its synthesis, by enzymes 
present in Ric/nus cells, and it is thought that the resulting sugar moieties are capable of non-specific 
interactions with cell surfaces. Thus, it appears that the known A chain conjugates are capable of a certain 

30 amount of binding with non target cells, even in the absence of the natural B chain, thus increasing the 
toxicity of such immunotoxins towards non target cells. 

Another problem with the known ricin A chain conjugates stems from the fact that the B chain seems to 
have an important secondary function in that it somehow assists in the intoxication process, apart from its 
primary function in binding the ricin molecule to the cell surface. This secondary function is lost if the B 

35 chain is replaced by a different carrier component such as a monoclonal antibody. 

If it were possible to prevent interactions between the cell surface via the A chain sugar moieties, whilst 
preserving the secondary toxicity-increasing function of the B chain, the toxicity of a whole ricin antibody 
conjugate towards normal ceils could be reduced, and towards target cells could be increased, thus 
improving the therapeutic index of the immunotoxin. It is also known that the natural ricin B chain is N- 

40 glycosylated and the B chain sugar moieties may also contribute to non specific interactions. Also, the 
sugar moieties in both chains enable the ricin molecule to be sequestrated by reticuloendothelial cells in 
the liver, and so would lead to the rapid excretion from the system of a drug based on a part or the whole of 
the ricin molecule in which such sugar moieties were still present. 

Attempts to remove all the sugar moieties from natural ricin by chemical or enzymatic methods have 

45 so far failed. Nevertheless the major obstacle confronting the use of known whole ricin-entibody 
conjugates is the presence of two galactose binding sites in the ricin B chain. These B chain galactose 
binding sites are primarily responsible for the non-specific cellular interactions of current whole ricin- 
antibody conjugates, particularly when used in vivo. Their presence in the natural toxin clearly eliminates 
or reduces the targeting specifity conferred by the antibody. 

so An improved immunotoxin based on ricin or another plant toxin of the ricin type, not suffering from 
these problems, could consist of a whole toxin molecule modified so that it is not N-glycosylated, and so 
that the B chain has no galactose recognition sites, but retains its secondary intoxication-promoting 
properties, coupled to a carrier moeity which delivers the toxin to the target cells/This could be a tumour- 
specific or cell/tissue specific vehicle such as a suitable monoclonal antibody. 

55 Our research which has so far been concentrated on ricin itself, has indicated that the assembly of ricin 
(and the related agglutinin which consists of two ricin-like molecules with slightly modified A and B chains) 
does not involve the separate synthesis of the A and B chains as the products of distinct mRNA's, but rather 
the initial formation of a single polypeptide precursor containing both the A chain 8nd B chain sequences. 
This is thought to apply in the case of other toxins of the same type. 

60 This invention is based on the idea of preparing a genetically-engineered microorganism capable of 
xpressing a molecule of a toxin of the ricin typ , as defined above, or alternatively part of such a toxin 
molecule, or e precurs r of such a molecule (which c uld be convert d to the toxin molecul itself) which 
toxin molecule could be modified as suggested above and could be used to construct an effective toxin 
conjugate by combining it with a tumour-specific or cell/tissue specific monocl nal antibody or ther 

65 carrier moiety, such as a hormone or lectin. 
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The fact that ricin is formed via a precursor polypeptide will enable a cell system to be constructed by 
known techniques which expresses a ricin precursor. The ricin precursor product could then be chemically 
or enzymaticaliy converted to the d sired modified ricin. An analag us technique could be used in the case 
of other ricin-type toxins as h r in defined. An alternative technique would be t divide from the DNA 

5 sequence that c des for the precursor tw sequences which code separately fr m the A and B chains, to 
insert these separated sequences into different cloning vehicles and to insert the resulting recombinant 
DNA molecules into separate host microorganisms. One such host would then express the A chain 
polypeptide sequence and the other the B chain polypeptide sequence. These sequences could then be 
combined to form the desired modified ricin molecule. This technique could obviously also be used for any 

10 ricin-type toxins in which the A and B chains are encoded by distinct mRNA gene pools. This latter 
approach would be preferred on safety grounds, in that separate and therefore non-toxic A and B chains 
would be expressed. 

According to one aspect of the invention we provide a biologically pure and homogeneous sample of 
DNA comprising a nucleotide sequence coding for at least a portion of a precursor of a ricin-type toxin 
15 polypeptide, or mutants thereof said portion coding for a polypeptide that displays the ribosome-inactivity 
property of the A chain of ricin. 

Said portion preferably comprises the A chain of the mature protein. 

More specifically, we provide a sample of DNA including at least a substantial portion of any of the 
following DNA sequences, said portion coding for a polypeptide that displays the ribosome-inactivity 
20 property of the A chain of ricin, which sample is biologically-pure: 

ATG tat gca gtg Gca ACA TGG CtT 



TGT TIT GGA TCC ACC TCA 6GG TGG TCT TtC ACA TTA GAG GAT AAC AAC ATA 
TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 
CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 
GCT GAT GTG AGA CAt GAT ATA CCA GTG TIG CCA AAC AGA GTT GGt TTG CCT 
ATA AAC CAA CGG TTT ATT TTA GTf GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTt ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC f AC CGT GCT 
GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAt GCA GAA GCA 
ATC ACT CAT CtT TTC ACT GAT GTT CAA AAT CGA TAT ACA TtC GCC TTT GGt 



GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT AtC 
GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT tAC 
AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC AC6 AGA ATT AGG tAC AAC CGG AGA TCT GCA CCA GAT CCT AGC 6TA ATT 
ACA CtT GAG AAT AGt TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
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CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 

TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCT ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA tCG TCA CAG TTT TCT TTG CTT ATA AGG CCA 
GTG GTA CCA MT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTt GAT GTT AGG GAT GGA AGA TTC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCt AAT ACA GAT GCA 
AAT CAG CtC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CM ATA TGG GAT 
AAT GGA ACC ATC ATA AAT CCt AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
GGG AAC AGT GGT ACC ACA CTT ACG GTG CM ACC AAC ATT TAT GCC GTT AGT 

CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTt ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 

GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 
ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 
CAA CGA TGG ATG TtC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 

ATG TAT GCA GTG GCA ACA TGG CtT 

TGT TIT GGA TCC ACC TCA GGG TGG TCT Tf C ACA TTA GAG GAT AAC AAC ATA 
TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 
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CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 
GCT GAT GTG AGA CAt GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTG CCT 
ATA AAC CAA CGG TTT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCT 
GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 
ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCT GGT AAT CTG AGA GAA AAT ATC 
GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCT ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT 

GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 
AAT CAG CTC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CM ATA TGG GAT 
AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
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GGG AAC AGT GGT ACC ACA CTT ACQ GTG CAA ACC AAC ATT TAT GCC GTT AGT 

CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTf ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 

GAC TGT AGC AGT GAA AAG GCT GAA CM CAG TGG GCT'CTT TAT GCA GAT GGT 
TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 
ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 
CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 
SeiSticrode POrti0n * ' nUC,eotide se 0 uence whi <* «« equivalent thereto by virtue of degeneracy of the 

30 nM A Accord ' n 9 W another aspect of this invention we provide a recombinant DNA molecule comprisino a 
herein eqUenCe C ° deS " p0 ' vpeptide se Q uence present in a plant toxin of the ricin type, as defined 

More specifically, we provide a recombinant DNA molecule containing a DNA sequence which codes 
for a A and B chain precursor polypeptide of a plant toxin of the ricin type. 
as Altematwely, we provide a recombinant DNA molecule containing a DNA sequence coding for at least 
a portion of the A chain of a plant toxin of the ricin type said portion coding for a polypeptide that displays 
the ribosome-mactivity property of the A chain of ricin. * 

nr ~ a t CC0Td, y u° a fu " her aspect of the presant invention w e provide a genetically modified host micro- 
organism which contains a recombinant DNA molecule as hereinbefore defined. 

40 * 1" t o 6 ^f combina " t DNA molecules according to the present invention, the nucleotide sequence coding 
tor the B chain may be modified to eliminate or inactivate the galactose binding sites, end the precursor 
polypeptide and hence in the mature protein, to eliminate or inactivate the galactose binding sites, and the 
ZZhTS'" 9 th f Si9 r a ' S u f0r N -9'VOosylation may also be modified To render them ineffeSvTor to 
eliminate them. Examples of techniques which may prove useful are deletion or oligonucleotide mediated 

45 "TiuIayGneSISi 

The host organism may be a plant cell or an animal cell or preferably a microorganism. 
m«Ji„n O m,C £ 0rflan,Sn !- m *t be 8 prokarvote or 8 eukaryote. As examples of prokaryotes may be 
TZT g ™wW« bacteria, e .g. £ coli, Methylophilus. methylotrophus and Alcaligenes 
eutrophus. and Gram-positive bacteria, e.g. Streptomyces, Bacillus subtilis and Arthrobacter. As examples 

50 of eukaryotes mey be mentioned yeasts, for example Saccharomyces cerevisiae. examples 
•„. u a r8c ° m '? ,n8n . t DNA molecules may comprise a cloning vector such as a plasmid or phage vector 
into wh,ch has been mserted the DNA sequence coding for at least a prortion of a precursor po^ype^d Tor 
at least a portion of either the A chain or the B chain, of a ricin-type plant toxin ypep«ae, or 

The cloning vector is preferably a plasmid although we do not exclude the possibility that it may be a 

55 fr^^fnJ - ^! P f™^,!! 6 8 natural| V-o«*™g Plasmid or preferably a composite derived from 

s£ZS£t£S?L£?l T dS ' Wh6re 8 f° 1 mp0! ? ite p,asmid is em P'0V a d ^ Preferably contains promoter 
sequences which improve expression of the ricin gene. 

ha rtS» m ^bp^, 8 "' 1 ^ 6 ^ 88 "]^^'^ mav De U8ed 88 c,onino vehicles are inter alia for Gram-negative 
bactena: pBR322. PAT153, pUC8, pGSS15 and pMB9; for Gram-positive bacteria: pVC6; end for 

60 %S!r "host^lag^ ' RP? ' ^ The Wi " be 8e,8Cled t0 be ***** 

rin„S?«l! 0 H Pr0 n d J e A a m8th0d of o btainin B 8 recombinant DNA molecule which comprises preparing a 
double-stranded DNA sequence which codes for a polypeptid sequence present in a plant toxin of th ricin 
type and inserting said double-stranded DNA sequence into a cloning vector 
55 More specifically such a method may c mprise isolating the mRNA which codes for the ricin A and B 
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chain precursor polypeptide, synthesising single-stranded cONA from said mRNA using reverse 
transcriptase and a suitable primer, assembling the second DNA strand onto the template formed by said 
first strand by means of DNA polymerase followed by S1 nuclease, and inserting the resulting doubl - 
stranded cDNA into e cloning vector. 
5 Alternatively, the cDNA assembled from the mRNA may be cut into separate porti ns which c de 
respectively for separate portions of the ricin m lecule, f r exampl f r the A and B chains, which portions 
are then inserted into separate cloning vectors. 

As stated above the cloning vector is preferably a plasmid such as pBR322, pAT153, of pUC8, and this 
may be cut open by means of the restriction endonuclease Pst I, and may be tailed with oiigo (dG), and 
10 annealed with the double-stranded cDNA which has been tailed with oiigo (dC). 

We also provide a method of producing a modified transformed host by introducing into a suitable 
host microorganism a recombinant DNA molecule according to this invention. 

The microorganism used as the host for cloning is preferably a Gram-negative bacterium and more 
preferably £ colt. 

is After cloning, the DNA sequence coding for the ricin precursor (or the precursor of another ricin-type 
toxin which is formed from a precusor) may be removed from the host cloning vector. It may then be 
divided into two portions which code for separate regions of the toxin molecule, for example the A and B 
chains, these portions introduced into separate second cloning vectors and new hosts modified with each 
of the resulting new recombinant DNA molecules. Alternatively it may be introduced whole into a second 

20 cloning vector. The second cloning vectors comprise suitable promoter sequences and the position and 
direction of insertion of the whole coding sequence or portion thereof into the second cloning vector are 
such that on introducing the new recombinant DNA molecules into suitable host microorganisms, e.g. f. 
coli or Sxerevistae, expression of the desired gene sequence is obtained. 

The preparation of a transformed host containing a DNA sequence which codes for the ricin A and B 

25 chain precursor polypeptide will now be described by way of example, first in general terms and then in 
detail. This process is summarised in the accompanying diagram. 

Firstly, the mRNA encoding this precursor was enriched in known manner by sucrose density gradient 
centrifugation. The corresponding cDNA was assembled in single-strand form onto this mRNA in known 
manner using the enzyme reverse transcriptase, a growing point having first been provided on the mRNA 

30 using oiigo (dT) as primer which binds onto the polyadenylated 3'-terminus of the mRNA. The immediate 
product of this reaction is a DNA — RNA hybrid. The RNA strand is removed by hydrolysis, leaving the 
single-strand DNA intact. This is converted to the double stranded form using the enzyme DNA polymerase 
in the presence of free nucleotides, which results in a hairpin-shaped molecule, the curved end of which is 
then removed by the single-strand specific nuclease S1. The resulting double strand cDNA is then tailed 

35 with oiigo <dC) using terminal transferase, size fractionated to remove small molecules, or vice versa and 
annealed with the pBR322 or pAT153 vector which has beien cut open with Pst I and tailed with oiigo <dG) 
using terminal transferase, the cytosine tails on the DNA base pairing with the guanine tails on the vector. 

The resulting chimaeric plasmids containing the DNA segment coding for the ricin precusor 
polypeptide were then used to transform E<oli DH1 cells, and the presence of the chimaeric plasmid was 

40 ensured by selecting cells displaying tetracycline resistance and ampicillin sensitivity. Over 1600 Tet r , Amp* 
clones were obtained. Colonies derived from each clone were transferred to nitrocellulose filters and 
clones containing the desired DNA sequence identified using a 32P-end labelled 20 mer oligonucleotide 
probe, having the DNA sequence ACCTACAAfcTT?TTSCT£CC which hybridises to DNA containing the 
complementary sequence TGGATGTT^AAjAAjGAjGG. As the ricin precursor polypeptide have been 

45 found to contain the amino sequence — Trp-Met-Phe-Lys-Asn-Asp-Gly- DNA sequence responsible for this 
is known from the genetic code to be the latter mentioned above. 

Using appropriate hybridisation and wash conditions, e.g. as described by Singer-Sam et at in (1983) 
Proc. Natl. Acad. Sci. (U.S.A.), Vol. 80 pp 802—806, 80 clones were selected as positively containing the 
desired DNA sequence, and of these, the eight largest in the plasmid pBR322 have been initially chosen for 

so further characterisation. Their relationship to the castor bean lectin precursor polypeptides has been 
confirmed using the hybrid release translation assay. Of the eight clones mentioned above, four 
respectively with 1614, 1960, 1059 and 1020 base pairs, have been selected for sequencing. 
In detail, the transformed host was prepared as follows: 

55 A. cDNA synthesis 

1 . mRNA extraction and fractionation 

100—200 g of ripening Ricinus seeds were frozen and ground to a powder in liquid nitrogen, and 
homogenised in a Waring blender for 1—2 minutes in 50 mM tris-HCI pH 9, 150 mM NaCI, 5 mM EDTA and 
5% SDS. The homogenate was extracted with an equal volume of phenol xhloroform (1:1) and the phases 

60 were separated by centrifugation. The organic phase and residu were reextracted with 0.5 volum of 20 
mM tris-HCI pH 9.0, 2 mM EDTA and the resultant aqueous phase was combined with th original one. The 
total aqueous phase was reextracted repeatedly with equal volumes of phen l:chl r form until n material 
was present at the interfac . RNA was precipitated by the addition of 2 volumes of cold ethanol after 
making th solution 200 mM in NaCI. 

6S 
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After overnight precipitation at -20°C the RNA was centrifuged at 10,000 rpm for 30 minutes in an MSE 
18 or MSE 21 centrifuge; the pellet was then washed repeatedly in 3 M NaAc pH 5.5 until no polysaccharide 
was detectable in the supernatant by ethanol precipitation. The final pellet was dissoved in 300 mM NaCI 
and precipitated as above. 

5 mRNA molecules bearing poly(A) tails were extracted by affinity chromatography on olig (dT)- 
ceiluiose: after hybridisation at room temperature for 30 min. in 400 mM NaCI, 20 mM tris-HCI pH 7.6, 0 2% 
SDS, the beads were pelleted and washed three times in the above buffer and two times in 200 mM NaCI, 
20 mM tris-HCI pH 7.6, 0.1% SDS. The slurry was poured into a column and washed further with the last 
buffer until the A,*, of the eluate reached the background level. Poly(A)-containing RNA was then eluted 

io with 20 mM tris-HCI pH 7.6 at 50°C. The eluate was monitored with an ISCO continuous flow UV cell 
Poly(A)-containing RNA was precipitated overnight from 200 mM NaCI by the addition of 2 volumes of cold 
ethanol at -20°C and was then washed three times with 70% ethanol, and redissolved in 10 mM tris-HCI dH 
7.0 to approximately 1 u,g/uj. 

The mRNA was heated for 2 min. at 65°C and quench cooled. Approximately 400 pg of poly(A)* RNA 

ts was layered on top of a 10—30% ribonuclease-free sucrose (Sigma) density gradient in 100 mM tris-HCI pH 
7.5, 0.5% SDS, 1 mM EDTA, and centrifuged in a Beckman L5— 65B centrifuge using an SW27 rotor at 
25,000 rpm at 17°C for 14 hours. 400 Ml fractions were collected with an ISCO density gradient fractionator 
using the continuous flow UV cell. 

Each fraction was made 200 mM in NaCI and precipitated with 2 volumes of cold ethanol by freeze- 

20 thawing in liquid nitrogen three times, and recovered by centrifugation in an Eppendorf microcentrifuge for 
30 minutes at 4°C, washed once with 70% ethanol, and redissolved in 10 ul of 10 mM tris-HCI pH 7 0 An 
aliquot from each fraction (1 uJ) was translated in a reticulocyte lysate cell free system and the lectin 
precursor immunoprecipitated to identify the fraction enriched for lectin mRNA. 

25 2. -First strand synthesis 

« * ^ rti ?- n !! e l po,y(Ar RNA was reverse transcribed at 0.5 ug/uJ in the presence of 50 mM tris-HCI pH 
?A ' \°, ^MMQ^ 2 ' 100 mM KCI ' 1 mM of dATP ' dTTP and dGTP ' 250 ^ dCTP, 0.06 ng/uJ oligo(dT) 12 

an ? °; 4 unlts/Ml of reverse transcriptase from avian myeloblastosis virus. (3H)dCTP or a- 
("P)dCTP were included in the reaction as appropriate. 

30 2r^°s m i^^?A?™ at 42 ° c for 45 minutes ' at which p ° int an e * uai v ° iume ° f 5 mM 

tris-HCI pH 8.3, 5 mM DTT, 250 u.M dCTP was added along with the same amount of enzyme as previously 
The reaction was incubated for a further 45 minutes at 45°C and terminated by freezing. Aliquots were 
analysed on 1% denaturing agarose gels along with the products of the second strand and S, nuclease 
reactions. 



35 
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3. Second strand synthesis 
mRNA^-cDNA hybrids were denatured by boiling the first strand reaction for 3 minutes and cooling 

rapidly. After pelleting insoluble material in the Eppendorf microfuge for 2 minutes the supernatant was 
transferred to a fresh chilled tube. For the standard reaction, reagents were added as follows, ignoring 
Hr^ e ?lhf. re J ? Y present: . dATP < dGT ? and dTTPto 100 pM, Hepes-KOH pH 6.9 to 105 mM, KCI to 92 mM 
™Jh a ! £L?! appr ° pnate ' t0 *? »M< and 0.1 units/ul of DNA polymerase. The reaction was allowed to 
proceed at 20T for 6 hours, at which time cDNA was removed from the mixture by gel filtration on 1 ml 
columns of B.o-Gel P60 in 10 mM tris-HCI pH 7.6, 20 mM NaCI, 1 mM EDTA. Fractions were monitored by 

45 ^ w^^r nH^I^ th* >** n C °,o ntln , 9 ' a " d F* exduded fractlons were poo,ed and Palpitated from 
45 0.3 M NaAc pH 6 by the addition of 2 volumes of cold ethanol. Precipitates were recovered by centrifugation 

RNA - eqSiva^ ^ 3 ° minutes in the cold ' and disso,v ed »" water to about 2.5 ng/uJ of 

4. Si nuclease digestion 

so Single-stranded regions of double-stranded cDNA were digested with S, nuclease from Asperaif/us 

>™ e ™°>! 3 °? m X NaCI ' 30 mM o^ aAc pH 4 ' 5 ' 3 mM ZnCL " Tha reac ^n was incubated Z 
15 minutes at 37"C : and ^then for 15 minutes at 15'C, and was terminated by the addition of tris-HCI pH 7.6 to 

J^iSFJSf ? m '* rt ^ aS tHen extracted with an volume of phenol:chloroform:isoamyl 
alcohol (25:24: ) and precipitated from 300 mM NaAc pH 6 with 2 volumes of cold ethanol. The precipitate 
55 was dissolved in 10 mM tris-HCI pH 8, 0.1 mM EDTA to 0.25 ug/ul RNA equivalent. 

5. Addition of homopolymer tails to DNA 

Double-stranded DNA was tailed using terminal transferase with dCTP at 0.001—0.01 ug/ul in the 
™ Km 6 " 0 ? fl! 4 ^ ™ P ? ta f ! lum cacod V' a te PH 7.6, 30 mM tris base, 0.1 mM DTT, 1 mM CoCI, and (>H) or 
60 ( P) labelled dCTP in 7&-150 fold excess over 3' termini. The reaction was carried out at 37°C for 6 
mmut s. The extent of incorporation of label was followed by assaying the amount f TCA - insoluble 
radioactivity as a proportion of the total radioactivity, counting in Bray's scintillant. 

The reaction was stopped by chilling and adding EDTA to 10 mM, after which unicorporated material 

« wfi re u° 0 Ve , A Y & el f,,trati0n as described - Tai,ed cD NA was precipitated as before, and dissolved In 1 M 
55 NaAc pH 8. 10 mM tns-acetate pH 8, 1 mM EDTA ready for fractionation 
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Pstl-cleaved pBR322 ON A was similarly treated, except thet dGTP replaced dCTP. 

6. Frecti nati n of tailed cDNA 

cDNA was fractionated on 5-20% linear sucrose density gradients in 1 M NaAc pH 8, 10 mM wis- 
acetate pH 8, 1 mM EDTA, and centrifuged overnight at 39.000 rpm in an SW50.1 rotor. DNA s dimentation 
was checked on a parallel gradient loaded with a mixture of Hinfl and Pstl digests of pBR322 DNA and 

SXi^inoS,?,?^ 1 were / un , on 3 ] % neutr81 8 9 arose 8 el - Fra «ions ^m the cDNA gradient were 
diluted with an equal volume of water and precipitated with 2 volumes of cold ethanol, and then pooled to 

rM5Sn!Eil!F TV ' 8r9e ° DNA Ua ? i0n (,ar96r tha " 2 ' 200 bp) ' an intermediate fraction 
600^""re dfecardSd containing smaller cDNAs (600-1.000 bp). cDNA molecules smaller than 

7.6. 0.2 e mM e EDTA ffaCti ° nS disSOlved t0 a PP«*imetely 5 ng/ul in 150 mM RbCI, 10 mM tris-HCI pH 

is B. Annealing and transformation 
1. Annealing 

a JSSSS£ With d o' ta i' ed PBR322 0r pAT153 in ^Proximately equimolar quantities, at 
fef 30 mim a ,« ",nn t'h 9/11 ^"or. Buffers were as described above. The mixtures were heated to 70°C 
and the " cp0 ' ed overn.ght to room temperature, and slowly chilled to 4»C. Competent cells 
were added and transformed as described below. *«»«>»» 
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2. Preparation of competent cells and transformation 

^ m «« HI C « cl (recAI ' nalA ' r >' m; ' endor ' R "< relAI7 > were erown in 10 ml cultures of psi broth (2% 
22 S 10 mM N «2r 20 mM MoCI * pH ad i ustad to 7.6 with KOH; all £SteS B ieal 

i3a?ed Z lilT 8t H 37 ° in i 8hakins J*" 1 "* t0 *~ m °' 3 - 1 ml of tnis was *2 
5 minutes and ^.^^ « ?n^" d,UI S 9r0wn t0 = 0M - 7,18 ce,,s were "en chilled on ice for 
15 minutes and harvested at 5,000 rpm for 5 minutes in an MSE 21 centrifuge at 4 4 C. Thev were then 

3n JimJlc °L^f Ce I! S i h . U ,f pre P flred were mixed w ith the annealed DNA samples, and Incubated on ice for 
?2rfT 8 « aft6r whlCh were neat-shocked at 42»C for 90-120 seconds. 1 ml of psi broth was added 
ZSAAZTi B t r ° Wn , p T C f ° r 1 h0Ur ^ Were then centrifuged briefly, and resuspende^ ?n 100 Ji 
W^MauSSTiSrt 8 P 00 " 9 U M/m ' tetracyc,ina (LB ia 1% M% veast extract 

„. Aftar 18-24 hours growth at 37 e C, colonies were counted and spotted onto LB plates containino 33 unl 

W clones we'rt'n"^ 08 : V n f m T B racirculariaad or uncut K5K5?Ww 

ttJacyS^e P ^ ,rans,erred in ordered a ™V a onto large plates of LB containing 14 Jg/ml 

C. Screening 

1. Labelling of oligonucleotide 
The ricin B chain specific oligomer (20 mer) was end labelled using polynucleotide kinase 500 no of 

^S^TSa^S?^ PH . 8 - 5 ' 1 .° mM MflCI * 5 m " ^ W "•MtiSSnSnfflS 

»«^« - 1 u Y< 52 A and 1 Ml Polynucleotide kinase (Boehringer) for 35 min at 37"C The 
reaction was stopped by adding an equal volume of 0.6 M NH.AC and the bulk of non incorporated vAtI 

EDTA 01% IS P Th? a ^ h thrOUOn 3 S !, P , nadeX G25 C °' Umn in 014 M NaCI ' «■* M "risX 6 . 0 005 M 
feOTA, 0.1% SDS. The probe was stored frozen at -20T. 

2. Colony Hybridization using oligonucleotide probe 

Transformants were grown on nitrocellulose filters (Schleicher & Schuell 0.45 u> layered over LB plus 
tetracycline. The filters, in triplicate, were then transferred to LB-Tet plates containing 200 uo/ml 

SSKSTJS !° f « " at 37 °- The filters were placed colon y side up °n ^o sn eets of 3 mm paper wetted 
55 with 0.5 M NaOH for 15 mms at room temperature. The same procedure was followed for the following two 
washes (1) with 1 M tns pH 8.0, and (2) with 1 M tris pH 8, 1.5 M NaCI (30 mins). The filters were air dried 
ana baked at 8(rC. 

Prehybridizations and hybridizations were done in double seated polythene bags. The filters were 
prehybndlzed in 0.9 M NaCI, 0.09 M tris 7.4, 0.006 M EDTA, 0.5% NP40, 2 x Denhardts, 0.2% SDS. 100 ug/ml 
so denatured single strand salmon sperm DNA and 70 ug/ml tRNA. Prehybridization was done for 4 h at 55°C 
The prehybridation buffer was then squeezed from the bag and fresh buffer add d that contained 50 no 
labeHed probe (to give a maximum concentration of 5 ng/ml buffer). Annealing was done overnight at 37°C 
Non stringent washes were done with 6 x SSC at room temperature. The filters were washed in 4 
changes of 6 x SSC over 3 h. The triplicate filters were then washed at three different temperatures 
65 determined from the base c mpositi n and degree of mismatch of the prob . Using 2°C for every A or T 
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and 4°C for every C or G in the probe the wash temperatures selected were 52°C, 56°C and 60°C. The filters 
were washed at the stringent temperature in 6 x SSC for 10 minutes and then dried thoroughly. The filters 
were exposed to X-ray film overnight. 

5 D. Hybrid selection procedure 

1. DNA binding 

Plasmid DNA was purified from the positive cione(s) and 10—15 ug linearized with EcoRI. After 
phenol :chloroform extraction and ethanol precipitation the pellet was dissolved in 0.5 ml 0.1 x SSC. 0.5 ml 
1 M NaOH was then added and the mixture allowed to stand for 15 mins at room temperature. 4 ml of a 
io prechilled neutralizing solution {1.5 M NaCI, 0.25 M HCI, 0.25 M tris-(ApH 8.0) was added and the 5 ml DNA 
sample sucked by vacuum through swinnies containing wetted Schleicher and Schuell 0.45 \x filter discs. 5 
ml 6 x SSC was then passed through the filter(s). These were air dried and then baked for 2 h at 80°C. 

2. Hybrid selection protocol 

is The filter(s) were placed in 5 ml bottles and prehybridized for 4 h at 41°C in 50% formarnide, 0.4 M 
NaCI, 10 mM pipes-NaOH pH 6.4, 4 mM EDTA, 0.5 ug/ml tRNA, 10 ug/ml poly (A). The buffer was removed 
and the filter(s) typically hybridized overnight at 41°C in 50% formarnide buffer (above) containing approx. 
20 ug poly (A)* RNA from castor beans. The buffer was removed and the filters washed twice for 15 mins in 
each of the following: (1) 1 x SSC, 0.5% SDS at room temperature, (2) 0.1 x SSC, 0.1% SDS at room 
20 temperature, (3) 0.1 x SSC, 0.1 % SDS at 50°C, (4) 0.1 x SSC, 0.1% SDS at room temperature. The filters 
were drained and 200 pi hybrid release buffer (90% formarnide, 10 mM pipes-NaOH pH 6.4, 1 mM EDTA, 
0.5% SDS) added to each and mixed for 30 mins at 40°C. The buffer was removed into a fresh eppendorf 
and NaCI added to .2 M. The released mRNA was precipitated with ethanol, rinsed several times in 70% 
lethanol, dried and dissolved in 5 pi sterile water. The sample was translated in a reticulocyte lysate cell free 
system and the products run out directly on an SDS-polyacrylamide gel or firstly immunoprecipitated with 
appropriate antisera. 

The DNA sequence coding for the above-mentioned ricin precursor polypeptide of two of the above- 
mentioned clones, referred to herein as pBRCL 6 and pBRCL 17 (RCL « Ricinus communis lectin) has now 
been determined by a combination of the Sanger dideoxy method (Sanger et al, 1977 — Proc. Natl. Acad. 
Sci. U.S.A. 74, 5463—67) and the procedure of Maxam and Gilbert (Maxam and Gilbert, 1980 — Math. 
Enzym. €5, 499—560). In order to determine the sequence at the ends of each Insert, the inserts were 
excised from pBR322 with Pst I and ligated into Pst I linearized, phosphatased plasmid pUC8 (Vierra and 
Messing, 1982— Gene 19, 25^-268). E. Coll DHI cells were transformed by these recombinant plasmids. 
These new recombinant plasmids are referred to herein as pRCL6 and pRCL17. 
35 It is apparent that the two inserts contain a region of common sequence and that together they 
represent a total ricin precusor sequence. There are no nucleotide differences between the overlapping 
regions of the inserts in pRCL6 and pRCL17. 

A new recombinant DNA molecule was then constructed that contains the complete nucleotide 
sequence encoding the ricin precursor polypeptide. This was achieved by isolating a fragment, 323 base 
40 pairs in length, obtained from pRCL17 by digestion with the restriction endonuclease Sau 961, and ligating 
this fragment to a fragment 1561 base pairs in length isolated after a partial digestion of pRCL6 with Sau 
961. Ligation was performed in 50 mM tris HCI (pH 7.4) 10 mM MgCI 2 , 10 mM dithiothreitol, 1 mM 
spermidine, 10 mM ATP, 0.1 mg/ml BSA with 5 units commercial T4 DNA ligase, and incubation proceeded 
overnight at 15°C. After a standard phenol/chloroform extraction and ethanol precipitation the ligated DNA 
45 was pelleted, dissolved in a small volume of 10 mM tris HCI (pH 7.4), 1 mM EDTA and digested to 
completion with Pst I. The resulting linearised DNA was then ligated (as above) with an equal quantity of 
Pst I linearised, phosphatased pUC8. The new recombinant DNA molecule containing the entire DNA 
sequence of the ricin precusor and referred to as pRCL617, was used in conventional manner to transform 
£ coii DHI cells. 

so The nucleotide sequence of pRCL617 is shown hereinafter. 

This sequence was deduced from the two overlapping cDNA inserts in clones pRCL6 and pRCL17 (the 
limits of the DNA inserts in each of these two clones are given below). 

Nucleotide residues are numbered in the 5' to 3' direction with the first residue of the codon specifying 
the ammo terminal residue of mature ricin A chain numbered 1 and the nucleotides on the 5' side of residue 

55 1 indicated by negative numbers. The 5' terminal sequence does not extend to the 5' end of the mRNA 
whereas the 3* terminal sequence shown is followed by a poly (dA) tract 27 residues long, thus 
representing the complete sequence of the region. The predicted amino acid sequence is given below the 
nucleotide sequence and differences with the published amino acid sequence of mature ricin A and B 
chains (Funatsu G., Kimura, M. and Funatsu, M. Agric. Biol. Chem. Vol 43, pp 2221—2224 (1979), and 

60 Yoshitake, S., Funatsu, G. and Funatsu, M. — Agric. Biol. Chem. Vol. 42, pp 1267—1274 (1978)) are indicated 
und meath. Residues absent from the published amino acid sequence are underlined with a dashed line 
and the position of amino acids present in the published sequence but absent from the derived sequence 
pres nted here are indicated by an asterisk. The dashed line beneath the 12 amino acid sequence linking 
the C-terminus of the A chain and the N-terminus of the B chain is bracketed Amino acids are numbered 

65 from the amino terminal residue of th mature A chain and the preceding residu s are indicated by 
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negative numbers. Potential sites for asparagine linked N-glycosylation are boxed and potential poly (A) 
signals are underlined. The insert of pRCL6 extends from nucleotide — 102 to residue 1512 and the insert of 
pRCL17 extends from nucleotide 733 to residue 1782. 

The intervening twelve triplets code for the linker amino acid sequence which is present in the 
precursor polypeptide and which is enzymatically removed in the cell to separate the A and B chains, which 
chains are joined by a disulphide bridge during the formation of the ricin molecule itself. This linker region 
as well as the presumptive amino terminal leader or signal sequence (amino acids -24 to -1) are not 
present in the sequences already publised by Funatsu et el. 

Preproricin is the whole polypeptide coded for the aforesaid DNA insert, i.e. from amino acid -24 to 
amino acid 541. Proricin, which is obtained from preproricin in the organism by removal of the amino acid 
leader sequence, extends from amino acid 1 to amino acid 541. 

-100 -50 
5'-AAACCGGGAG GAAATACTAT TGTAATATGG ATG TAT GCA GTG GCA ACA TGG CTT 

Met Tyr Ala Val Ala Thr Trp Leu 
-20 



TGT TTT GGA TCC ACC TCA GGG TGG TCT TTC ACA TTA GAG GAT AAC AAC ATA 
Cys Phe Gly Ser Thr Ser Gly Trp Ser Phe Thr Leu Glu Asp Asn Asn lie 

-10 -1 1 



TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GOT GCC ACT GTG 
Phe Pro Lys Gin Tyr Pro lie lie I Asn Phe Thr) ' Thr Ala Gly Ala Thr Val 

10 



CAA AGC TAC ACA AAC TTT 
Gin Ser Tyr Thr Asn Phe 
20 



100 

ATC AGA GCT GTT CGC GGT CGT TTA ACA ACT GGA 
lie Arg Ala Val Arg Gly Arg Leu Thr Thr Gly 

30 



?, CT ?* T S 1 ? 3 AGA CAt GAT ATA CCA GTG TO CCA AAC AGA GTT GGT TTG CCT 
Ala Asp Val Arg His Asp lie Pro Val Leu Pro Asn Arg Val Gly Leu Pro 
40 Glu 50 



200 

ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT TCT 
He Asn Gin Arg Phe lie Leu Val Glu Leu Ser Asn His Ala Glu Leu Ser 

60 Gin 



250 

GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC TAC CGT GCT 
Val Thr Leu Ala Leu Asp Val Thr Asn Ala Tyr Val Val Gly Tyr Aro Ala 
70 Ser 80 



?t A tfJ AGC 2* TAT U C 777 ^ CCT GAC ^ GAA SSr GCA GAA GCA 
Gly Asn Ser Ala tyr Phe Phe His Pro Asp Asn Gin Glu Asp Ala Glu Ala 

90 100 



ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 
He Thr His Leu Phe Thr Asp Val Gin Asn Arg Tyr Thr Phe Ala Phe Gly 

110 120 
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400 

GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT AtC 
Gly Asn Tyr Asp Arg Leu Glu Gin Leu Ala Gly Asn Leu Arg Glu Asn lie 

130 



450 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
Glu Leu Gly Asn Gly Pro Leu Glu Glu Ala lie Ser Ala Leu Tyr Tyr Tyr 
140 150 



500 

AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 

Ser Thr Gly Gly Thr Gin Leu Pro Thr Leu Ala Arg Ser Phe He lie Cys 

160 170 



550 

ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
lie Gin Met lie Ser Glu Ala Ala Arg Phe Gin Tyr lie Glu Gly Glu Met 

180 



600 

CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
Arg Thr Arg He Arg Tyr Asn Arg Arg Ser Ala Pro Asp Pro Ser Val He 
190 200 



65 0 

ACA CTT GAG AAT AGt TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 

Thr Leu Glu Asn Ser Trp Gly Arg Leu Ser Thr Ala He Gin Glu Ser Asn 
210 220 



700 

CAA GGA GCC TTT GCT AGT CCA ATf CAA CTG CAA AGA CGT AAT GGT TCC AAA 
Gin Gly Ala Phe Ala Ser Pro He Gin Leu Gin Arg Arg | Asn Gly SeT| Lys 

230 — Asp 



750 

TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCt ATC ATA GCT CTC ATG GTG 
Phe Ser VaJ Tyr Asp Val Ser He Leu He Pro He lie Ala Leu Met Val 
240 Leu 250 



800 

TAT AGA TGC GCA CCT CCA CCA tCG TCA CAG TTT TCT TTG CTt ATA AGG CCA 

Tyr Arg Cys Ala Pro Pro Pro Ser Ser Gin Phe Ser Leu Leu He Arg Pro 

260 ( * ... 



GTG GTA CCA AAT TTT AAT GCt GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
Val Val Pro Asn Phe Asn Ala Asp Val Cys Met Asp Pro Glu Pro lie Val 
) 280 290 



CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 
Arg lie Val Gly Arg Asn Gly Leu Cys Val Asp Val Arg Asp Gly Arg Ph 

300 Asn 
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950 

CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCf AAT ACA GAT GCA 
His Asn Gly Asn Ala lie Gin Leu Trp Pro Cys Lys Ser Asn Thr Asp Ala • 
Asn His 310 320 



1000 

AAT CAG CtC TGG ACT TTfi AAA AGA GAC AAT ACT ATT C(jA TCT AAT GGA AAG 

Asn Gin Leu Trp Thr Leu Lys Arg Asp Asn Thr lie Arg Ser Asn Gly Lys 

330 340 



1050 

TGT TTA ACT ACT TAC G<$G TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 
Cys Leu Thr Thr Tyr Gly Tyr Ser Pro Gly Val Tyr Val Met lie Tyr Asp 

Pro Ser 



1100 

TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 
Cys Asn Thr Ala Ala Thr Asp Ala Thr Arg Trp Gin lie Trp Asp 
360 Thr Asp — Glu Asn 



AAT GGA ACC AtC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
[Asn Gly Thr) lie lie Asn Pro Arg Ser Ser Leu Val Leu Ala Ala Thr Ser 

380 390 



1200 

GGG AAC AGT GGT ACC ACA CtT ACG GTG CAA ACC AAC ATT TAT GCC GTT A6T 
Gly Asn Ser Gly Thr Thr Leu Thr Val Gin Thr Asn lie Tyr Ala Val Ser 

400 



1250 

CAA GGT TG6 CTT CCT ACT AAT AAT ACA CAA CCT TTT GTT ACA ACC ATT 6TT 

Gin Gly Trp Leu Pro Thr I Asn Asn fhTl Gin Pro Phe Val Thr Thr lie Val 

Pro Phe Trp 420 



1300 

GGG CTA TAT GGT CTG TGt TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 
Gly Leu Tyr Gly Leu Cys Leu Gin Aia Asn Ser Gly Gin Val Trp lie Glu 

430 Val 440 



1350 

GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
Asp Cys Ser Ser Glu Lys Ala Glu Gin Gin Trp Ala Leu Tyr Ala Asp Gly 
Ser Cys 450 Ser 



1400 

TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTt ACA AGT GAT TCT AAT 

Ser He Arg Pro Gin Gin Asn Arg Asp Asn Cys Leu Thr Ser Asp Ser Asn 

Asn 460 Asn Arg 470 



1450 

ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GfiC CCT GCA TCC TCT GGC 
He Arg Glu Thr Val Val Lys He Leu Ser Cys Gly Pr Ala Ser Ser Gly 
480 490 
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1500 

CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
Gin Arg Trp Met Phe Lys Asn Asp Gly Thr He Leu Asn Leu Tyr S r Gly 
Glu 500 



1550 

TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
Leu Val Leu Asp Val Arg Arg Ser Asp Pro Ser Leu Lys Gin He He Leu 
510 Ale 520 



1600 

TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT TGA 

Tyr Pro Leu His Gly Asp Pro Asn Gin He Trp Leu Pro Leu Phe 

Tr P * *Leu Pro 

. 1650 
TAGACAGATT ACTCTCTTGC AGTGTGTGTG TCCTGCCATG AAAATAGATG GCTT AAATAA 

1700 

AAAGGACATT GTAAATTTTG TAACTGAAAG GACAGCAAGT TATTGCAGTC CAGtATCTA^ 

1750 1780 
lAASAGCACA ACTATTGTCT TGTGCATTCT AAATTT-Poly(A) 

Claims 

1. A recombinant DNA molecule characterised in that it includes es an insert DNA coding for the A 
chain of nan and having the following nucleotide sequence or a portion thereof coding for a polypeptide 
that displays the ribosome-inactivity property of the A-chain of ricin: 

ATA 

TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGT CGT TIA ACA ACT GGA 

GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTG CCT 

ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT TCT 

GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCT 

GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 

ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TtC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
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AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 

ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 

CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 

ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 

CAA GGA GCC TTT GCT AGT CCA ATf CAA CTG CAA AGA CGT AAT GGT TCC AAA 

TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCT ATC ATA GCT CTC ATG GTG 

TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT 
^^"tteS!^ 8 S8quence Which is et ' uivalent 10 "id sequence or portion by virtue of the degeneracy of the 
„ recombinant DNA molecule according to Claim 1 characterised in that it includes as in insert DNA 

« fiS^KS 1 ?- h Vm9 ^ ^ ,OWin9 nUC,e ° tide sequence or 8 P ortion thereof «*n B for a protein 
25 that displays the cell binding and nbosome-inactivity property of the ricin: 

ATA 

so TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGT CGT TIA ACA ACT GGA 

GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA MC AGA GTT GGT TTG CCT 

ATA AAC CAA CGG TTT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT TCT 

GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCT 

<s GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 

ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 
so GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATt TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
65 ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 

15 
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CAA GGA GCC TTT GCT AGT CCA ATt CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCT ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT TCT TTG CTf ATA AGG CCA 
GTG GTA CCA AAT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 

CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCt AAT ACA GAT GCA 

o 

AAT CAG CTC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAt ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 
AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 
35 CAA GGT CIT CCT ACT AAT AAT ACA CM CCT TTT GTT ACA ACC ATT GTT 

GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 
GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 
ATA CGG GAA ACA GiT GTT AAG ATC CTC TCT TGT G6C CCT GCA TCC TCT GGC 
so CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 

TT(j GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 

ss 

TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 

or a nucleotide sequence which is equivalent to said sequence or portion by virtue of the degeneracy of the 
genetic code. 

60 >,u 3 ' A recombina , nt DNA molecule according to Claim 1 or Claim 2 characterised in that it includes at the 
*" terminal end DNA coding for a signal polypeptide having the nucleotide sequence 
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ATG TAT GCA GTG GCA ACA TGG CTT TGT TTT GCA 
TCC ACC TCA GGG TGG TCT TTC ACA TTA GAG 
GAT AAC AAC 

or a nucleotide sequence which is equivalent thereto by virtue of the degeneracy of the genetic code. 

4. A recombinant ON A molecule according to Claim 1, characterised in that it includes as an insert the 
following DNA sequence which codes for preproricin 

ATG TAT GCA GTG GCA ACA TGG CtT 

TGT TTI GGA TCC ACC TCA GGG TGG TCT TtC ACA TTA GAG GAT AAC AAC ATA 
TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 
CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 
GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA MC AGA GTT GGt TTG CCT 
ATA AAC CAA CGG TTT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC f AC CGT GCT 
GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 
ATC ACT CAT CtT TTC ACT GAT GTT CAA AAT CGA TAT ACA TtC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCT GGT AAT CTG AGA GAA AAT AtC 
GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC AC6 AGA ATT AGG tAC AAC CGG AGA TCT GCA CCA GAT CCT AGC 6TA ATT 
ACA CtT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCt ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TtT TCT TTG CTT ATA AGG CCA 

17 
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GTG GTA CCA AAT TTT AAT QCt GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 

CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 
AAT CAG CtC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CC6 GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CM ATA TGG GAT 
20 AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CT A GTT TTA GCA GCG ACA TCA 

GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 
CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTT ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 

30 

GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
35 TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 

ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT G6C CCT GCA TCC TCT GGC 
CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 

45 

TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 

so I'enriZcZto* 8eQUenCe " Weh is et » uiv8,ent t0 88id sequence or portion by virtue of the degeneracy of the 
bJ'iST ° N . A 0101801,18 according to any preceding claim characterised in that said insert has 
DNA molellle 8 9 V " Ct °' Wh ' Ch * * P ' aSmid ° r 8 D8 « 8 "'°P h8 S 8 » obtain said recombinant 

55 according"^ h ° 8t -".Gaining a recombinant DNA moiecu.e 

is E 7 Co)r^l^H^V?^'T accordin 9 ,0 a * im 6 < wh8 » appendant to Claim 4, wherein the host 
or pMB9 Characten88d In tnat the clon,n 9 V8Ctor 18 P'asmfd selected from pBR322, pAT153, pUC8. pGS15 

cell i&^SSmUS i micr ?? r fl an j sm according to Claim 6. when appendant to Claim 4, wherein the host 
and YEp6 Cemsve, characterised in that the cloning vector is pMa91, pMA230, YRp7, pLC544 

chJzrHJZZ?* ° f preparins ' cDNA h8vi "9 a nucleotide sequence as defined in eny of Cleims 1 t 4 
characterised by separating from a mixture of mRNA's obtained from Ricinus Communis tissue a 

55 lEEaSlT™ C ° nta 9 mRNA C ° di " 9 f ° r PrePf0riCin 8nd 8 y nth88i8i "9 cDNA from S,s by 
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10. A method of obtaining a recombinant DNA molecule by inserting a double-stranded cONA 
according to Claim 8 into a cloning vector. 

11. A method of obtaining a genetically*modified £ Coli or Sacchoromyces cervisiae host 
characterised in that a recombinant DNA molecule according to Claim 9 is introduced into said host. 

Patentanspruche 

1. Ein rekombinantes DNS-Molekui, dadurch gekennzeichnet daS es als Einfugung DNS enthalt, 
welche die A-Kette des Ricins kodiert und die folgende Nukleotid-Sequenz oder einen Teil davon besitzt, 
weiche ein Polypeptid kodiert, das die Ri bosom en-lnaktivitats-Eigenschaft der A-Kette des Ricins zeigt: 

ATA TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 



CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 
GCT GAT GTG AGA CAT GAT ATA CCA 6TG TIG CCA AAC AGA GTT GGt TTG CCT 

20 

ATA AAC CAA CGG TtT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 
25 GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC TAC CGT GCT 

GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC MT CAG GAA GAt GCA GAA GCA 
ATC ACT CAT CtT TTC ACT GAf GTT CAA AAT CGA TAT ACA TtC GCC TTT GGT 
GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 

35 

AGT ACT G6T GGC ACT CAG CTT CCA ACT CTG GCT CGT TfcC TTT ATA ATf TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 

40 

CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC 6TA ATT 
45 ACA CtT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GA(j TCT AAC 

CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 



TTC AGT GTG TAC 6AT GTG AGT AtA TTA ATC CCt ATC ATA GCT CTC ATG GTG 

TAT AGA TGC GCA CCT CCA CCA f CG TCA CAG TtT 

oder eine Nukleotid-Sequenz, die wegen der Degenerierung des genetischen Kodes dieser Sequenz oder 
diesem Teil Equivalent ist 

2. Rekombinantes DNS-Molekul nach Anspruch 1, dadurch gekennzeichnet, daft es als Einfugung DNS 
enthalt die Ricins kodiert und die folgende Nukleotid-Sequenz oder einen Teil davon besitzt, welche ein 
Protein kodiert, das die Zeil-Bindungs- und Ribosomen-lnaktivitats-Eigenschaft des Ricins zeigt: 

ATA TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 



65 CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 
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GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTG CCT 

ATA AAC CAA CGG TlT ATT TTA GTT GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTt ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCT 
G6A AAT AGC GCA TAT TTC TTT CAT CCT GAC MT CAG GAA GAT GCA GAA GCA 
ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TtC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCT GGT AAT CTG AGA GAA AAT ATC 
GAG TTG GGA AAT GGT CCA CTA GAG GAG GtT ATC TCA GCG CTT TAT TAT TAC 
ACT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCt ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT TCT TTG CTT ATA AGG CCA 
GTG GTA CCA AAT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTt GAT GTT AGG GAT GGA AGA TtC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCt AAT ACA GAT GCA 
AAT CAG CtC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CC6 GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 
AAT GGA ACC AtC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 
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CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTt ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 

5 

GAC TGT AGC AGT GAA AAG GCT GAA CM CAG TGG 6CT CTT TAT GCA GAT GGT 
10 TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTt ACA AGT GAT f CT AAT 

ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 
CAA CGA TGG ATG TfC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
TAC CCT CTC CAt GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 

25 Sfe^eJfCfviSn^" 2 ' * W6gen * De 9 enerierun 9 *. genetischen Kodes dieser Sequenz oder 
3. R ek ombinantes DNS-Molekul nach Anspruch 1 oder 2, dadurch gekennzeichnet, daS es am NH,- 
terminalen Ende DNS enthalt, welche eine Signal-Polypeptid kodiert, welches die Nukleotid-Sequenz 

30 ATG TAT GCA GTG GCA ACA TGG CTT TGT TTT GCA 

TCC ACC TCA GGG TGG TCT TTC ACA TTA GAG 

35 GAT AAC AAC 

beste oder eine Nukleotid-Sequenz, die wegen der Degenerierung des genetischen Kodes dazu equivalent 

ATG TAT GCA GTG GCA ACA TGG CTT 

TGT TIT GGA TCC ACC TCA GGG TGG TCT TTC ACA TTA GAG GAT AAC AAC ATA 
so TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGT CGT TIA ACA ACT GGA 
GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA MC AGA GTT GGT TTG CCT 



60 



ATA AAC CAA CGG TTT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTt ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC TAC CGT GCT 
6S GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 
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ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TtC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 
AGT ACT GGT GGC ACT CAG- CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 
ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC AC6 AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCf ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT TCT TTG CTf ATA AGG CCA 
GTG GTA CCA AAT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TTC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT 6CA 
AAT CAG CTC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CC6 GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 
AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
GGG AAC AGT GGT ACC ACA CTT ACG GTG CAA ACC AAC ATT TAT GCC GTT AGT 
CAA GGT TGG CTT CCT ACT AAT AAT ACA CAA CCT TTT GTt ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 
GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 
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ATA CGG gaa aca gtt gtt aag atc Ctc tct tgt g6c cct gca tcC tct ggc 



CAA CGA TGG ATG Tf C AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 



TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 



TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TfiG TTA CCA TTA TTT 

Oder eine Nukleotid-Sequenz, die wegen der Degenerierung des genetischen Kodes dieser Sequenz Oder 
diesem Teii equivalent ist. 

15 5. Rekombinantes DNS-Molekui nach einem der vorhergehenden Anspruche, dadurch gekennzeichnet, 
dafi die Einfugung in einen Klonierungs-Zwischentrager, der ein Piasmid oder ein Bakteriophage ist, 
eingefuhrt wurde, urn das rekombinante DNS-Molekui zu erhalten. 

6. Modifizierte E. Coii oder Saccharomyces Cervisiae-Wirtszelle, weiche ein rekombinantes DNS- 
Molekui nach einem der vorhergehenden Anspruche enthilt. 

20 7. Ein modifizierter Wirts-Mikroorganismus nach Anspruch 6 und 4, wobei die Wirts-Zelle E. Coii ist, 
dadurch gekennzeichnet, da& der Klonierungs-Zwischentrager ein Piasmid, ausgewahlt aus pBR322, 
pAT153, pUC8 f pGSl5 oder pMB9 ist 

8. Ein modifizierter Wirts-Mikroorganismus nach Anspruch 6 und 4, wobei die Wirts-Zelle 
Saccharomyces Cervisiae ist, dadurch gekennzeichnet daB der Klonierungs-Zwischentrager pMA91, 

2s pMA230, YRp7, pLC544 and YEp6 ist. 

9. Methode zur Darstellung von cDNS mit emer Nukleotid-Sequenz wie in einem der Anspruche 1 bis 4 
definiert, gekennzeichnet durch Abtrennen einer Molekular-gewichtsfraktion, die Preproricin kodierende 
mRNS enthalt aus einer Mischung von aus Ricinus Communis-Gewebe erhaltenen mRNS und durch 
Synthetisieren von cDNS aus dieser mittels reverser Transkription. 

30 10. Method, ein rekombinantes DNS-Molekui durch Einsetzen einer doppelstrengigen cDNS nach 
Anspruch 8 in einen Klonierungs-Zwischentrager zu erhalten. 

1 1 . Methode zur Erhaltung eines genetisch modifizierten E. Coii- Oder Saccharomyces Cervisiae-Wirts, 
dadurch gekennzeichent, daB ein rekombinantes DNS-Molekui nach Anspruch 9 in diesen Wirt eingefuhrt 
wird. 

35 

Revendieations 

1. Molecule d'ADN recombinant caracterisee en ce qu'eile comprend en tant que segment d'insertion 
un ADN codant pour la chaine A de la ricine et ayant ia sequence nucl^otidique ci-dessus ou une partie de 
40 celle-ci codant pour un polypeptide qui manifeste la propriete d'inactivation ribosomale de la chaine A de la 
ricine: 

ATA 



TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 



CAA AGC TAC ACA AAC TTT ATC AGA GtT GTT CGC GGT CGT T1A ACA ACT GGA 
GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTG CCT 



ATA AAC CAA CGG TtT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 
GTt ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC TAC CGT GCT 



G6A AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAt GCA GAA GCA 



ATC ACT CAT CtT TTC ACT GAt GTT CAA AAT CGA TAT ACA TTC GCC TTT GGT 
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GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 



GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 



ACT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 



ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 



CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 

J5 

ACA CTT GAG AAT AGf TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 



CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT GGT TCC AAA 



TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCf ATC ATA GCT CTC ATG GTG 



TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TtT 

ou une sequence nucelotidique qui est Squivalente d tadite sequence ou partie de la sequence en raison de 
la degenescence du code gen6tique. 
30 2. Molecule d'ADN recombinant selon fa revendication 1, caracterisSe en ce qu'elle comprend en tant 
que segment d'insertion un ADN codant pour la ricine et ayant la sequence nucteotidique ci-dessous ou 
une portion de celie-ci codant pour un proline qui manifeste la propria de liaison aux cellules et 
deactivation ribosomaie de la ricine: 

35 ATA 



TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 

40 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGT CGT TIA ACA ACT GGA 



GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA AAC AGA GTT GGT TTG CCT 

45 

ATA AAC CAA CG G TTT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 



50 GTT ACA TTA GCC CTG GAT GTC ACC AAT GCA TAT GTG GTC GGC TAC CGT GCT 



GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAT GCA GAA GCA 

55 

ATC ACT CAT CTT TTC ACT G AT GTT CAA AAT CGA TAT ACA Tf C GCC TTT GGT 



GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 

60 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GCG CTT TAT TAT TAC 



AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATt TGC 
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ATC CAA ATG ATT TCA GAA GCA CCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 
CGC ACG AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC GTA ATT 
ACA CTT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 
CAA GGA GCC TTT GCT AGT CCA ATt CAA CTG CAA AGA CGT AAT GGT TCC AAA 
TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCT ATC ATA GCT CTC ATG GTG 
TAT AGA TGC GCA CCT CCA CCA TCG TCA CAG TTT TCT TTG CTT ATA AGG CCA 
GTG GTA CCA AAT TTT AAT GCT GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 
CGT ATC GTA GGT CGA AAT GGT CTA TGT GTT GAT GTT AGG GAT GGA AGA TtC 
CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCT AAT ACA GAT GCA 
AAT CAG CtC TGG ACT TTG AAA AGA GAC AAT ACT ATT CGA TCT AAT GGA AAG 
TGT TTA ACT ACT TAC GGG TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 
TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CM ATA TGG GAT 
AAT GGA ACC ATC ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 
GGG AAC AGT GGT ACC ACA CTT ACG GTG CM ACC AAC ATT TAT GCC GTT AGT 
CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTt ACA ACC ATT GTT 
GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 
GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 
TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTT ACA AGT GAT TCT AAT 
ATA CGG GAA ACA GTT GTT AAG ATC CTC TCT TGT GGC CCT GCA TCC TCT GGC 
CAA CGA TGG ATG TTC AAG AAT GAT GGA ACC ATT TTA AAT TTG TAT AGT GGA 
TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 
TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 
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ou une sequence nucl£otidique qui st equivalente a ladite sequence ou partie de la sequence ou partie de 
celle-ci en raison de la deg^nescence du code genettque. 

3. Molecule d'ADN recombinant seion la revendication 1 ou la revendication 2, caracterisee en ce 
qu'ell compr nd a 1'extremite NH r terminale un ADN codant pour un polypeptide de signal ayant la 
sequence nucieoiditique: 

ATG TAT GCA GTG GCA ACA TGG CTT TGT TTT GCA 
TCC ACC TCA GGG TGG TCT TTC ACA TTA GAG 
GAT AAC AAC 

ou une sequence nucteotidique qui est equivalente a celle-ci en raison de la degenescence du code 
g6n6tique. 

4. Molecule d'ADN recombinant selon ia revendication 1, caracteris6e en ce qu'elle comprend en tant 
segment d'insertion la sequence d'ADN suivante qui code pour la preproricine: 

ATG TAT GCA GTG GCA ACA TGG CtT 

TGT TIT GGA TCC ACC TCA GGG TGG TCT TfC ACA TTA GAG GAT AAC AAC ATA 

TTC CCC AAA CAA TAC CCA ATT ATA AAC TTT ACC ACA GCG GGI GCC ACT GTG 

CAA AGC TAC ACA AAC TTT ATC AGA GCT GTT CGC GGt CGT TIA ACA ACT GGA 

GCT GAT GTG AGA CAT GAT ATA CCA GTG TIG CCA MC AGA GTT GGt TTG CCT 

ATA AAC CAA CGG TtT ATT TTA GTt GAA CTC TCA AAT CAT GCA GAG CTT TCT 

GTt ACA TTA GCC CTG GAT GTC ACC AAT GCA TAt GTG GTC GGC tAC CGT GCT 

GGA AAT AGC GCA TAT TTC TTT CAT CCT GAC AAT CAG GAA GAt GCA GAA GCA 

ATC ACT CAT CTT TTC ACT GAT GTT CAA AAT CGA TAT ACA TfC GCC TTT GGT 
GGT AAT TAT GAT AGA CTT GAA CAA CTT GCt GGT AAT CTG AGA GAA AAT ATC 

GAG TTG GGA AAT GGT CCA CTA GAG GAG GCT ATC TCA GC& CTT TAT TAT TAC 

AGT ACT GGT GGC ACT CAG CTT CCA ACT CTG GCT CGT TCC TTT ATA ATT TGC 

ATC CAA ATG ATT TCA GAA GCA GCA AGA TTC CAA TAT ATT GAG GGA GAA ATG 

CGC AC& AGA ATT AGG TAC AAC CGG AGA TCT GCA CCA GAT CCT AGC 6TA ATT 

ACA CfT GAG AAT AGT TGG GGG AGA CTT TCC ACT GCA ATT CAA GAG TCT AAC 

CAA GGA GCC TTT GCT AGT CCA ATT CAA CTG CAA AGA CGT AAT G&T TCC AAA 



26 



15 



20 



EP 0145111 B1 

TTC AGT GTG TAC GAT GTG AGT ATA TTA ATC CCt ATC ATA GCT CTC ATG GTG 

5 TAT AGA TGC GCA CCT CCA CCA tCG TCA CAG TtT TCT TTG CTT ATA AGG CCA 

GTG GTA CCA AAT TTT AAT GCt GAT GTT TGT ATG GAT CCT GAG CCC ATA GTG 

io CGT ATC GTA GGT CGA AAT GGT CTA TGT GTt GAT GTT AGG GAT GGA AGA TtC 

CAC AAC GGA AAC GCA ATA CAG TTG TGG CCA TGC AAG TCt AAT ACA GAT GCA 

AAT CAG CtC TGG ACT TTG AAA AGA GAC AAT ACT ATT C&A TCT AAT GGA AAG 

TGT TTA ACT ACT TAC GGG TAC AGT CCG GGA GTC TAT GTG ATG ATC TAT GAT 

TGC AAT ACT GCT GCA ACT GAT GCC ACC CGC TGG CAA ATA TGG GAT 

25 AAT GGA ACC At C ATA AAT CCC AGA TCT AGT CTA GTT TTA GCA GCG ACA TCA 

GGG AAC AGT GGT ACC ACA CtT ACG GTG CAA ACC AAC ATT tAT GCC GTT AGT 

CAA GGT TGG CIT CCT ACT AAT AAT ACA CAA CCT TTT GTt ACA ACC ATT GTT 

GGG CTA TAT GGT CTG TGC TTG CAA GCA AAT AGT GGA CAA GTA TGG ATA GAG 

GAC TGT AGC AGT GAA AAG GCT GAA CAA CAG TGG GCT CTT TAT GCA GAT GGT 

40 TCA ATA CGT CCT CAG CAA AAC CGA GAT AAT TGC CTf ACA AGT GAT tCT AAT 

ATA CGG GAA ACA GlT GTT AAG ATC CTC TCT TGT G6C CCT GCA TCC TCT GGC 

CAA CGA TGG ATG TtC AAG AAT GAT GGA ACC ATT fTA AAT TTG TAT AGT GGA 

TTG GTG TTA GAT GTG AGG CGA TCG GAT CCG AGC CTT AAA CAA ATC ATT CTT 

TAC CCT CTC CAT GGT GAC CCA AAC CAA ATA TGG TTA CCA TTA TTT 

55 ou une sequence nucleotidique qui est equivalente a ladite sequence ou partie de celle-ci en raison de la 

deg^nescence du code g6netique. 

5. Mol6cule d'ADN recombinant selon I'une quelconque des revendications pr6c6dentes, caracterisde 

en ce que (edit segment d'insertion a ete introduit dans un vecteur de cionage qui est un piamide ou un 

bacteriophage pour obtenir ladite molecule d'ADN recombinant. 
50 6. Cellule hdte modifiee d'ECo// ou de Saccharomyces Cervisiae contenant une molecule d'ADN 

recombinant selon I'une quelconque des revendications pr6c6dentes. 

7. Microorganisms hdte modifie selon la revendication 6, lorsqu'elle est d6pendante de la 

revendication 4, dans laquelle la cellule h6te est £ coli, caract6ris6 en ce que I vecteur de cionage est un 

plasmide choisi parmi pBR322, pAT153, pUC8, pGS15 ou pMB. 
55 8.- Microorganisme hdte m difie selon la revendication 6, lorsqu'elle st d6pendante de la 
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revendication 4, dans laquelle la cellule hdte est Sacchoromyces Cervisiae, caracterisg en ce que le vecteur 
d clonage est pMa91, pMA230, YRp7, pLC544 et YEp6. 

9. Proc6d§ de preparation d'ADNc ayant une sequence nucleotldique telle que ddfinie dans Tune 
quelconque des revendications 1 a 4, caract6ris6 en ce que Ton separe a partir d'un melange d'ARNm 

5 obtenu a partir de tissu de Ricinus Communis une fraction de poids mol6culaire contenant I'ARNm codant 
pour la prdproricine et en ce que Ton synthase I'ADNc a partir de celui-ci par transcription inverse. 

10. Proc6de d'obtention d'une molecule d'ADN recombinant par insertion d'un ADNc double brin selon 
la revendication 8 dans un vecteur de clonage. 

11. Proc6d6 d'obtention d'un hdte g6n6tiquement modifte d'E coii ou Sacchoromyces cervisiae, 
w caracteris6 en ce que Ton introduit une molecule d'ADN recombinant selon la revendication 9 dans ledit 

hdte. 
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